Processing math: 71%
Research article Special Issues

A discrete extension of the Xgamma random variable: mathematical framework, estimation methods, simulation ranking, and applications to radiation biology and industrial engineering data

  • Count data modeling and its practical applications have garnered significant attention in recent research, owing to its relevance in a wide range of fields. This study specifically explores a novel discrete distribution characterized by two parameters, which is derived using the survival discretization method. The statistical properties of this distribution are thoroughly explained in closed forms, with several key mathematical attributes also derived. These characteristics underscore the distribution's effectiveness in modeling data that exhibit (right-skewed) asymmetry and have extended heavy tails, making it particularly suitable for such real-world applications. Furthermore, the failure rate function corresponding to this distribution is particularly appropriate for scenarios characterized by an increasing or bathtub-shaped failure rate over time. The model is also highly versatile, offering valuable insights into probabilistic modeling for datasets that display over dispersion, under dispersion, or equi dispersion. The study introduces several estimation techniques, including the maximum product of spacings, Anderson–Darling, right–tail Anderson–Darling, maximum likelihood estimation, least squares, weighted least squares, Cramer–Von–Mises, and percentile methods. Each of these methods is explained in detail, providing a comprehensive understanding of their application. A ranking simulation study is conducted to evaluate the performance of these estimators across varying sample sizes, using ranking techniques to identify the most effective estimator in different scenarios. The analysis of real-world datasets from biotechnology and industrial engineering further demonstrates the practical utility and relevance of the proposed model. The results highlight the model's ability to offer accurate and insightful analyses, reinforcing its significance in count data modeling and its wide-ranging applications.

    Citation: Mohamed S. Algolam, Mohamed S. Eliwa, Mohamed El-Dawoody, Mahmoud El-Morshedy. A discrete extension of the Xgamma random variable: mathematical framework, estimation methods, simulation ranking, and applications to radiation biology and industrial engineering data[J]. AIMS Mathematics, 2025, 10(3): 6069-6101. doi: 10.3934/math.2025277

    Related Papers:

    [1] Khaled M. Alqahtani, Mahmoud El-Morshedy, Hend S. Shahen, Mohamed S. Eliwa . A discrete extension of the Burr-Hatke distribution: Generalized hypergeometric functions, different inference techniques, simulation ranking with modeling and analysis of sustainable count data. AIMS Mathematics, 2024, 9(4): 9394-9418. doi: 10.3934/math.2024458
    [2] Wael W. Mohammed, Kalpasree Sharma, Partha Jyoti Hazarika, G. G. Hamedani, Mohamed S. Eliwa, Mahmoud El-Morshedy . Zero-inflated discrete Lindley distribution: Statistical and reliability properties, estimation techniques, and goodness-of-fit analysis. AIMS Mathematics, 2025, 10(5): 11382-11410. doi: 10.3934/math.2025518
    [3] Jia Chen, Renato De Leone . A survival tree for interval-censored failure time data. AIMS Mathematics, 2022, 7(10): 18099-18126. doi: 10.3934/math.2022996
    [4] M. G. M. Ghazal . Modified Chen distribution: Properties, estimation, and applications in reliability analysis. AIMS Mathematics, 2024, 9(12): 34906-34946. doi: 10.3934/math.20241662
    [5] Alanazi Talal Abdulrahman, Khudhayr A. Rashedi, Tariq S. Alshammari, Eslam Hussam, Amirah Saeed Alharthi, Ramlah H Albayyat . A new extension of the Rayleigh distribution: Methodology, classical, and Bayes estimation, with application to industrial data. AIMS Mathematics, 2025, 10(2): 3710-3733. doi: 10.3934/math.2025172
    [6] Emrah Altun, Mustafa Ç. Korkmaz, M. El-Morshedy, M. S. Eliwa . The extended gamma distribution with regression model and applications. AIMS Mathematics, 2021, 6(3): 2418-2439. doi: 10.3934/math.2021147
    [7] Heba S. Mohammed, Osama E. Abo-Kasem, Ahmed Elshahhat . Computational analysis of generalized progressive hybrid log-logistic model and its modeling for physics and engineering applications. AIMS Mathematics, 2025, 10(5): 10709-10739. doi: 10.3934/math.2025487
    [8] Refah Alotaibi, Hassan Okasha, Hoda Rezk, Abdullah M. Almarashi, Mazen Nassar . On a new flexible Lomax distribution: statistical properties and estimation procedures with applications to engineering and medical data. AIMS Mathematics, 2021, 6(12): 13976-13999. doi: 10.3934/math.2021808
    [9] Yin Xu, Ning Wang . Variable selection and estimation for accelerated failure time model via seamless-L0 penalty. AIMS Mathematics, 2023, 8(1): 1195-1207. doi: 10.3934/math.2023060
    [10] Refah Alotaibi, Mazen Nassar, Ahmed Elshahhat . Inference of exponentiated Teissier parameters from adaptive progressively type-II hybrid censored data. AIMS Mathematics, 2025, 10(5): 11556-11591. doi: 10.3934/math.2025525
  • Count data modeling and its practical applications have garnered significant attention in recent research, owing to its relevance in a wide range of fields. This study specifically explores a novel discrete distribution characterized by two parameters, which is derived using the survival discretization method. The statistical properties of this distribution are thoroughly explained in closed forms, with several key mathematical attributes also derived. These characteristics underscore the distribution's effectiveness in modeling data that exhibit (right-skewed) asymmetry and have extended heavy tails, making it particularly suitable for such real-world applications. Furthermore, the failure rate function corresponding to this distribution is particularly appropriate for scenarios characterized by an increasing or bathtub-shaped failure rate over time. The model is also highly versatile, offering valuable insights into probabilistic modeling for datasets that display over dispersion, under dispersion, or equi dispersion. The study introduces several estimation techniques, including the maximum product of spacings, Anderson–Darling, right–tail Anderson–Darling, maximum likelihood estimation, least squares, weighted least squares, Cramer–Von–Mises, and percentile methods. Each of these methods is explained in detail, providing a comprehensive understanding of their application. A ranking simulation study is conducted to evaluate the performance of these estimators across varying sample sizes, using ranking techniques to identify the most effective estimator in different scenarios. The analysis of real-world datasets from biotechnology and industrial engineering further demonstrates the practical utility and relevance of the proposed model. The results highlight the model's ability to offer accurate and insightful analyses, reinforcing its significance in count data modeling and its wide-ranging applications.



    Count data modeling is a fundamental aspect of statistical analysis, providing essential insights across a wide range of scientific fields. Count data consist of non–negative integers that reflect the frequency of events within a given unit of measurement, making them crucial in disciplines such as epidemiology, ecology, finance, social sciences, biotechnology, and engineering. These data help identify patterns and trends, enabling researchers to better understand various real–world phenomena. For example, in epidemiology, count data are used to track the number of disease cases within a population over time, aiding in the study of disease prevalence, transmission patterns, and the effectiveness of public health interventions. In ecology, they play a key role in assessing species' populations, biodiversity, and conservation efforts. By accurately modeling count data, researchers can make informed decisions and develop strategies to address complex challenges in different domains. In finance, counts such as trading volumes or the frequency of specific market events inform risk assessments and portfolio management. In the social sciences, count data play a crucial role in studies of survey responses [1] and voting patterns [2]. Similarly, in biotechnology and engineering, count data are utilized to analyze occurrences such as gene expression levels [3], manufacturing defects [4], or system failures [5], aiding in advancements and optimization within these fields [6].

    However, count data pose distinct challenges for analysis. Overdispersion is a common issue, where the observed variance exceeds the expected value under standard Poisson models, necessitating the use of more flexible models such as the negative binomial distribution. Zero–inflation, another challenge, occurs when datasets contain an excess of zero counts, which requires specific techniques to avoid bias. Additionally, count data may exhibit complex dependency structures, including spatial or temporal correlations, which complicates the development and selection of models.

    Given the fundamental importance of discrete probability models in capturing such patterns, we have developed and thoroughly investigated a new discrete probability distribution, building upon our previous survey work to address these specific challenges. This newly introduced model serves as the discrete counterpart to an extension of the xgamma (Exg) distribution. The Exg distribution has gained significant prominence across various domains, such as reliability analysis, survival modeling, and risk assessment, due to its flexibility and ability to capture a broad range of data patterns. It is particularly valued for its capacity to model right-skewed and heavy-tailed data, making it highly effective in representing diverse real-world phenomena. By integrating features from multiple distributions, the Exg model offers researchers a powerful tool for modeling complex datasets, facilitating a more profound understanding of the underlying mechanisms that drive observed events. For further details and an in-depth exploration of the Exg distribution, please refer to [7]. A random variable Z is said to follow the Exg distribution when its reliability function (RF), say R(z;ζ,τ), and probability density function (PDF), say g(z;ζ,τ), can be expressed as follows:

    R(z;ζ,τ)=ζτ2eτzζ+τ2η(z;ζ,τ); z>0, (1.1)

    and

    g(z;ζ,τ)=τ3ζ+τ2[1+ζz22]eτz; z>0, (1.2)

    respectively, where

    η(z;ζ,τ)=1ζ+1τ2+zτ+z22;ζ>0, τ>0.

    Using survival discretization techniques, a discrete analog of an extension to the xgamma (DExg) distribution can be derived. This technique is a statistical method designed to transform continuous probability distributions, like the Exg distribution, into discrete forms that are more applicable for real-world scenarios. Since real-world data are frequently recorded in discrete units or intervals, survival discretization is particularly useful. Through this process, the probability mass function (PMF) can be obtained as follows:

    Pr(Z=z;.)=R(z;.)R(z+1;.); z=0,1,2,3,.... (1.3)

    For a more detailed explanation of the survival discretization approach, refer to Roy [8]. Several discrete distributions based on the discrete survival function have been proposed and studied, including the discrete Burr–Hatke [9], discrete Pareto [10], discrete inverse Rayleigh [11], discrete exponentiated generalized family of distributions [12], a novel version of geometric distribution [13], discrete inverse Weibull [14], discrete Burr–XII [10], discrete Lindley [15], new discrete extended Weibull [16], new Poisson–weighted exponential [17], discrete generalized geometric [18], discrete Gompertz [19], discrete generalized exponential Type Ⅱ [20], discrete Ramos–Louzada [21], and the two-parameter discrete Lindley [22]. While numerous discrete probability models exist in the statistical literature, the growing complexity of daily generated data underscores the need for more flexible discrete distributions to effectively tackle these challenges. This is particularly crucial in cases that require closed-form expressions for statistical and reliability properties, as well as for addressing issues such as dispersion, heavy tails, irregular failure rates, outliers, and kurtosis, where traditional models often fall short. As a result, the DExg model has been proposed and thoroughly examined for its unique features, which allow it to capture a wide range of data behaviors and overcome the limitations of conventional models. The motivations for this can be summarized as follows:

    ● To introduce a discrete distribution with straightforward and closed-form expressions for essential properties, including the PMF, RF, moments, and related measures, facilitating efficient data modeling and fitting.

    ● To design a discrete model that effectively fits practical datasets, such as those arising in industrial or mortality studies, while accommodating bathtub–shaped or increasing failure rates and ensuring a non–decreasing failure rate function for realistic applications.

    ● To develop a flexible model that is capable of fitting diverse data types, including equi–dispersed, over–dispersed, and under–dispersed datasets, as well as distributions with positive skewness or leptokurtic characteristics.

    ● To propose a robust model specifically tailored for the analysis and fitting of count data, outperforming the existing discrete models commonly used in statistical applications.

    The structure of this article is organized as follows: Section 2 introduces the DExg distribution, developed using the survival discretization approach. Section 3 explores the statistical properties of the distribution in detail. Section 4 addresses the estimation of distribution parameters through various methodologies. In Section 5, a comprehensive simulation study is conducted, incorporating ranking techniques. Section 6 demonstrates the versatility of the DExg distribution through analyses of multiple datasets. Finally, Section 7 concludes with a summary of the key findings and outlines directions for future research.

    Applying Eqs (1.1) and (1.3), the cumulative distribution function (CDF) and its associated PMF are expressed as follows:

    F(z;ζ,τ)=1ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ); zN0, (2.1)

    and

    Pr(Z=z;ζ,τ)=ζτ2ζ+τ2[η(z;ζ,τ)eτη(z+1;ζ,τ)]eτz; zN0, (2.2)

    respectively. The behavior of the PMF under various conditions is given as

    Pr(Z=z;ζ,τ)={(1eτ)eτz;ζ0(12τ2z2+τz+1)eτz(τ2[12z2+z+12]+τ[z+1]+1)eτ(z+1);ζ0;τ0[(τ2+ζ)(12ζτ2+ζτ+τ2+ζ)eτ][τ2+ζ]1;z0. (2.3)

    Figure 1 displays the PMF plots for different values of the parameter ζ and τ.

    Figure 1.  The PMF plots of the DExg model.

    It is important to highlight that the PMF is particularly effective for modeling data with a unimodal shape. Additionally, it demonstrates its versatility by effectively analyzing asymmetric, positively skewed, and heavy-tailed data, making it well-suited for capturing a wide range of data patterns. The hazard rate function (HRF) of the DExg model can be expressed as

    h(z;ζ,τ)=η(z;ζ,τ)eτη(z+1;ζ,τ)eτη(z;ζ,τ); zN0. (2.4)

    The reversed hazard rate function (RHRF) can be given as

    r(z;ζ,τ)=ζτ2[η(z;ζ,τ)eτη(z+1;ζ,τ)]eτz[ζ+τ2ζτ2eτ(z+1)η(z+1;ζ,τ)]; zN0. (2.5)

    The HRF represents the likelihood of an item's failure at a specific age z, and it forms part of the broader hazard function equation. This equation assesses the probability that an item, having survived up to a certain time t, will continue to last beyond that point. In essence, it quantifies the chance that an item surviving at one moment will persist to the next. The hazard rate is particularly relevant for non-repairable items and is often referred to as the failure rate. Its importance spans the design of secure systems in various fields, including commerce, engineering, finance, insurance, and regulatory industries. Mathematically, it is expressed as the ratio of the PMF to the corresponding RF. On the other hand, the RHRF for a random lifetime is defined as the ratio of the PMF to the CDF. This concept is crucial in analyzing censored data and finds applications in areas such as forensic science. Figures 2 and 3 shows the HRF and RHRF plots for different values of the parameters ζ and τ.

    Figure 2.  The HRF plots of the DExg distribution.
    Figure 3.  The RHRF plots of the DExg distribution.

    Consider a situation where we have a collection of n random variables, represented as Z1,Z2,...,Zn, arranged in non–decreasing order, such that Z1:nZ2:n ...Zn:n. In the realm of order statistics, it is essential to understand that there are no restrictions on whether these Xi values are independent or identically distributed. Nevertheless, many established results related to order statistics are derived on the basis of the classical assumption that the Xi values are independent and identically distributed. For the DExg distribution, the CDF of the ith order statistic can be expressed as

    Fi:n(z;ζ,τ)=nk=i(nk)[Fi(z;ζ,τ)]k[1Fi(z;ζ,τ)]nk=nk=inkj=0Φ(n,k)m[1ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ)]k+j,    

    where Φ(n,k)m=(1)j(nk)(nkj). Moreover, the associated PMF of the ith order statistic is given by

    fi:n(z;ζ,τ)=Fi:n(z;ζ,τ)Fi:n(z1;ζ,τ)=nk=inkj=0Θ(n,k)(m,j)(ζ,τ)[[η(z;ζ,τ)eτη(z+1;ζ,τ)]eτz]k+j, (3.1)

    where

    Θ(n,k)(m,j)(ζ,τ)=Φ(n,k)m(ζτ2ζ+τ2)k+j.

    Thus, the rth moments of Zi:n can be expressed as

    μ(r)i:n=E(Zri:n)=z=0nk=inkj=0zr Θ(n,k)(m,j)(ζ,τ)[[η(z;ζ,τ)eτη(z+1;ζ,τ)]eτz]k+j. (3.2)

    Moreover, since the support of the DExg distribution is N0, therefore, μ(1)1:n, μ(1)n:n, μ(2)1:n, and μ(2)n:n can be directily derived utilizing Fi:n(z;ζ,τ) (see Arnold et al. [23]) as follows

    μ(1)1:n=z=0[ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ)]n,μ(1)n:n=z=0[1{1ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ)}n],μ(2)1:n=2z=0z[ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ)]n+μ(1)1:n,μ(2)n:n=2z=0z[1{1ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ)}n]+μ(1)n:n,

    where

    η(z+1;ζ,τ)=1ζ+1τ2+z+1τ+(z+1)22;ζ>0, τ>0.

    According to μ(1)1:n, μ(1)n:n, μ(2)1:n, and μ(2)n:n, σ21,1:n and σ2n,n:n can be reported. The numerical results provide insights into the behavior of the order statistics' expectations and variances for different values of n and τ. As n increases, μ(1)1:n decreases, indicating that the smallest order statistic shifts towards lower values. In contrast, μ(1)n:n remains close to 49, suggesting that the maximum order statistic stabilizes at an upper bound. The second moments, μ(2)1:n and μ(2)n:n, follow similar trends, with the former decreasing and the latter maintaining a high magnitude. The variances, σ21,1:n and σ2n,n:n, exhibit an interesting pattern: σ21,1:n is very small, implying that the smallest order statistic is highly concentrated, while σ2n,n:n is relatively large but decreases slightly as n grows, signifying increased stability in the maximum order statistic. These results highlight the impact of the sample size and distribution parameters on the dispersion and expectation of extreme order statistics.

    In probability theory and statistical analysis, several key functions and measures are employed to characterize the behavior of random variables. The moment generating function (MGF), defined as MZ(t;.)=E(etZ), serves to encapsulate all moments of a distribution, enabling the extraction of higher-order moments through differentiation with respect to t at t=0. This function is instrumental in deriving essential quantities like the expectation and variance, which provide insight into the central tendency and dispersion of a random variable, respectively. Moreover, skewness and kurtosis, which are higher-order moments obtained from the MGF, characterize the asymmetry and the tail behavior (or peakedness) of the distribution.

    In the case of discrete random variables, the probability generating function (PGF), defined as GZ(s;.)=E(sZ) (where s is a real number) offers a concise representation of the PMF and facilitates the derivation of probabilities and moments. The PGF is particularly valuable for studying integer-valued random variables, focusing on the distribution of outcomes. For the DExg distribution, both the MGF and PGF can be expressed in explicit forms, though their expressions may be intricate. Special cases, such as when the parameter ζ approaches zero or infinity, yield simplified representations of these functions, offering valuable insights into the underlying probabilistic structure. The MGF of the DExg distribution can be formulated as

    MZ(t;ζ,τ)={1eτeteτ;ζ0Υ(t;τ)2(e3t3eτ+2t+3e2τ+te3τ);ζ,

    where

    Υ(t;τ)=τ2(eτ+2te2τ+t+eτ+t+e2τ)+2τ(eτ+2te2τ+teτ+t+e2τ)+2(2e2τ+teτ+2te3τ+e2t2eτ+t+e2τ).

    The PGF can be derived from the MGF by substituting s=et. Furthermore, the first four moments of the DExg distribution can be proposed as

    E(Z)={1eτ1;ζ0τ2(eτ+1)eτ+2τ(eτ1)eτ+2(e2τeτ+1)2(e3t3e2τ+3eτ1);ζ,
    E(Z2)={eτ+1e2τ2eτ+1;ζ0τ2(e2τ+8eτ+3)eτ+2τ(e2τ+2eτ3)eτ+2(e3τe2τeτ+1)2(e4τ4e3τ+6e2t4eτ+1);ζ,
    E(Z3)={e2τ+4eτ+1e3t3e2τ+3eτ1;ζ0τ2(e3τ+23e2τ+41eτ+7)eτ+2τ(e3τ+9e2τ3eτ7)eτ+2(e4τ+2e3τ6e2τ+2eτ+1)2(e5τ5e4τ+10e3τ10e2t+5eτ1);ζ,
    E(Z4)={e3τ+11e2τ+11eτ+1e4τ4e3τ+6e2t4eτ+1;ζ0τ2(e4τ+54e3τ+240e2τ+170eτ+15)eτ+2τ(e4τ+24e3τ+30e2τ40eτ15)eτ+2(e5τ+9e4τ10e3τ10e2τ+9eτ+1)2(e6τ6e5τ+15e4τ20e3τ+15e2τ6et+1);ζ.

    Table 1 presents various statistical measures for different combinations of the parameters ζ and τ.

    Table 1.  Numerical descriptors for characterizing the DExg distribution.
    Measure ζτ 0.1 0.3 0.5 10.0 20.0 30.0
    0.05 26.144 5.230 2.201 47×105 2.1×109 9.6×1014
    E(Z) 0.5 28.980 8.486 4.180 58×105 2.6×109 1.2×1013
    1.5 29.402 9.123 4.934 85×105 3.7×109 1.7×1013
    0.05 323.296 29.152 7.455 47×105 2.1×109 9.5×1014
    Variance(Z) 0.5 310.903 35.704 12.844 58×105 2.6×109 1.1×1013
    1.5 297.979 34.506 12.837 85×105 3.7×109 1.6×1013
    0.05 12.365 5.573 3.386 1.000 1.000 0.99999
    Variability index(Z) 0.5 10.728 4.207 3.072 1.000 1.000 0.99999
    1.5 10.134 3.781 2.601 1.000 1.000 0.99999
    0.05 1.077 1.579 1.991 146.244 21729.74 3.2×106
    Skewness(Z) 0.5 1.084 1.082 1.179 130.254 19509.592 2.9×106
    1.5 1.261 1.103 1.084 108.062 16318.50 2.4×106
    0.05 4.694 6.222 8.450 21392.61 4.7×108 1.0×1013
    Kurtosis(Z) 0.5 4.7304 4.744 4.854 16972.09 3.8×108 8.4×1012
    1.5 4.580 4.863 4.747 11683.34 2.6×108 5.9×1012

     | Show Table
    DownLoad: CSV

    As τ increases, the expected value and variance of Z decrease significantly, with higher values of τ leading to near-zero values, indicating that the distribution becomes more concentrated around its mean. The variability index decreases similarly, stabilizing near 1 for a large τ, reflecting reduced relative dispersion. Conversely, skewness and kurtosis grow exponentially with increasing values of τ, showing that the distribution becomes highly asymmetric with heavy tails and large outliers at higher τ values. Emphasizing the pivotal role that ζ and τ play in shaping the distribution's central tendency, dispersion, and overall form, the DExg distribution proves particularly useful for modeling data that are over-dispersed, under-dispersed, or equi-dispersed. This distribution is especially effective in handling right-skewed data with a leptokurtic shape, making it ideal for discussing complex real-world scenarios where skewness and high kurtosis are prevalent.

    Entropy is a crucial measure in information theory and statistical modeling, quantifying the uncertainty or randomness of a probability distribution. In the context of the DExg distribution, entropy can be particularly useful for evaluating the complexity of data generated by the model, assessing the model selection criteria, and optimizing information-based decision-making processes. For example, in reliability analysis, entropy helps quantify the unpredictability of failure times, while in actuarial sciences, it aids in risk assessments. For a discrete random variable Z that takes values from the set Ω, and is distributed according to p: Ω[0,1], the residual entropy of the variable Z and its cumulative expression can be listed, respectively, as follows:

    I(z;.)=z=0F(z;.)log(F(z;.)),

    and

    CI(z;.)=z=0(1F(z;.))log(1F(z;.)).

    By utilizing algebraic techniques (the Taylor series for log(1W);|W|<1) with respect to the DExg distribution, we can derive

    I(Z)=z=0[1ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ))]log[1ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ)]=z=0n=11n[1ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ))][ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ)]n,

    and

    CI(Z)=ζτ2ζ+τ2z=0[eτ(z+1)η(z+1;ζ,τ))]log[ζτ2eτ(z+1)ζ+τ2η(z+1;ζ,τ)].

    Algebraic methods, particularly the Taylor series expansion for log(1W) with |W|<1, are applied to streamline the mathematical analysis of the DExg distribution. This approach provides several advantages, including analytical simplicity by making complex expressions more manageable, improved numerical computation through infinite series approximations, and stable convergence properties that ensure the validity of the derived results. Additionally, the Taylor series expansion facilitates the generalization and approximation of key statistical measures, such as entropy and information indices, enhancing interpretability and practical implementation. By leveraging these techniques, this study strengthens both the theoretical foundation and applied relevance of the DExg distribution in statistical modeling. The Rényi entropy of the random variable Z, derived using the generalized binomial expansion, is defined as

    IR(Z;ω)=11ωlog([ζτ2ζ+τ2]ωz=0[η(z;ζ,τ)eτη(z+1;ζ,τ)]ωeωτz), (3.3)

    where ω>0 and ω1. Shannon entropy can be computed as a specific instance of Rényi entropy when the parameter ω approaches 1. Table 2 presents numerical calculations for the entropies reported in this section.

    Table 2.  Entropies using the DExg distribution.
    Measure ζτ 0.1 0.3 0.5 0.7 0.9
    0.5 5.7992 1.9862 1.1384 0.7291 0.4937
    I(Z) 0.7 5.7825 1.9819 1.1651 0.7701 0.5344
    0.9 5.7726 1.9754 1.1768 0.7942 0.5621
    0.5 7.4406 2.5435 1.5304 1.0552 0.7714
    CI(Z) 0.7 7.4298 2.5294 1.5345 1.0773 0.8024
    0.9 7.4056 2.5193 1.5327 1.0870 0.8201
    0.5 0.2633 0.4499 0.6040 0.7688 0.9347
    IR(Z;1.5) 0.7 0.2637 0.4496 0.5909 0.7374 0.8898
    0.9 0.2639 0.4501 0.5850 0.7192 0.8601

     | Show Table
    DownLoad: CSV

    The data indicate that both τ and ζ influence these measures. Notably, I(Z) and CI(Z) generally decrease as τ increases for each ζ, with CI(Z) being consistently larger than I(Z). The effect of ζ becomes more noticeable at higher τ values. IR(Z;1.5) shows the opposite trend, increasing as τ increases, with minimal variation across ζ values for small values of τ but more divergence at higher τ values.

    The HRF plays a crucial role in reliability analysis, particularly for two-component systems arranged in parallel and series configurations. In a two-component parallel system, the system fails only when both components fail, leading to an overall hazard rate that reflects the redundancy and increased reliability provided by the parallel structure. Conversely, in a two-component series system, the system fails if either component fails, resulting in a higher overall hazard rate due to the system's vulnerability to single-point failures. Understanding the HRF for these configurations is essential for modeling and optimizing systems' reliability in various practical applications. Suppose that T1 and T2 are two independent DExg random variables with the parameters (ζ1,τ1) and (ζ2,τ2), respectively. Then, the HRF of W=min(T1,T2) can be formulated as

    hW(z;ξ)=Pr(min(T1,T2)=z)Pr(min(T1,T2)z)=Pr(min(T1,T2)z)Pr(min(T1,T2)z+1)Pr(min(T1,T2)z)=Pr(T1z)Pr(T2z)Pr(T1z+1)Pr(T2z+1)Pr(T1z)Pr(T2z),

    then

    hW(z;ξ)=η(z;ζ1,τ1)eτ1η(z+1;ζ1,τ1)eτ1η(z;ζ1,τ1)+η(z;ζ2,τ2)eτ2η(z+1;ζ2,τ2)eτ2η(z;ζ2,τ2)[η(z;ζ1,τ1)eτ1η(z+1;ζ1,τ1)][η(z;ζ2,τ2)eτ2η(z+1;ζ2,τ2)]η(z;ζ1,τ1)η(z;ζ2,τ2)e(τ1+τ2),

    where ξ=(ζ1,τ1,ζ2,τ2) and zN0. Since the HRF of T1 and T2 is increasing, then the HRF of W is also increasing. Similarly, the HRF of W=max(T1,T2) can be given as

    hW(z;ξ)=11{1ζ1τ21eτ1(z+1)ζ1+τ21η(z+1;ζ1,τ1)}{1ζ2τ22eτ2(z+1)ζ2+τ22η(z+1;ζ2,τ2)}1{1ζ1τ21eτ1(z+1)ζ1+τ21η(z;ζ1,τ1)}{1ζ2τ22eτ2(z+1)ζ2+τ22η(z;ζ2,τ2)}.

    The HRF of the random variable W is also increasing. On the other hand, the RHRF of W and W can be reported as

    rW(z;ξ)=11ζ1ζ2τ21τ22(ζ1+τ21)1(ζ2+τ22)1e(z+1)(τ1+τ2)η(z;ζ1,τ1)η(z;ζ2,τ2)1ζ1ζ2τ21τ22(ζ1+τ21)1(ζ2+τ22)1e(z+1)(τ1+τ2)η(z+1;ζ1,τ1)η(z+1;ζ2,τ2),

    and

    rW(z;ξ)=ζ1τ21[η(z;ζ1,τ1)eτ1η(z+1;ζ1,τ1)]eτ1z[ζ1+τ21ζ1τ21eτ1(z+1)η(z+1;ζ1,τ1)]+ζ2τ22[η(z;ζ2,τ2)eτ2η(z+1;ζ2,τ2)]eτ2z[ζ2+τ22ζ2τ22eτ2(z+1)η(z+1;ζ2,τ2)]{[ζ1τ21[η(z;ζ1,τ1)eτ1η(z+1;ζ1,τ1)]eτ1z][ζ2τ22[η(z;ζ2,τ2)eτ2η(z+1;ζ2,τ2)]eτ2z][ζ1+τ21ζ1τ21eτ1(z+1)η(z+1;ζ1,τ1)][ζ2+τ22ζ2τ22eτ2(z+1)η(z+1;ζ2,τ2)]}.

    By analyzing the HRF and RHRF, engineers and analysts can optimize a system's performance and plan maintenance strategies based on actual operational time before failure or a failure's occurrence. The HRF and RHRF of the DExg distribution play a crucial role in understanding systems' reliability, particularly in two-component parallel and series systems. The HRF is essential for evaluating series systems, where the failure of a single component results in the system's failure, making it a key tool for identifying weak links. The DExg distribution's flexibility in modeling bathtub-shaped and increasing failure rates makes it well-suited for systems prone to early-life failures or gradual wear-out. Conversely, the RHRF is particularly valuable for analyzing parallel systems, which fail only when all components fail. It provides insights into the expected operational lifetime of a system before failure, making it especially useful for components with extended lifetimes. The DExg model's ability to capture heavy-tailed behavior further enhances its applicability in reliability analysis. By accommodating various failure patterns, it aids in developing effective maintenance strategies and risk assessments, particularly in safety-critical fields such as electronics, mechanical systems, and industrial applications. Thus, the DExg distribution is a powerful tool for enhancing system's reliability in both series and parallel configurations.

    The mean active time (MAT) function for discrete random variables measures the expected remaining lifetime of a system or component, given that it has survived up to a certain point. It is a key tool in reliability analysis and survival studies, helping us to understand the future performance of a system after reaching a specific time. In contrast, the mean inactive time (MIT) function focuses on the average duration the system or component has already lasted before a certain time point. Both the MAT and MIT functions are crucial for assessing the lifetime behavior of discrete random variables, with applications in fields like actuarial science, medical studies, and industrial reliability testing. For complex models like the DExg distribution, both the MAT and MIT can be derived in explicit forms, although their expressions may be quite elaborate. Simplifications occur in special cases, such as when the parameter ζ approaches 0 or , providing clearer insights into the probabilistic behavior of the model. The MAT, say κ(i), for the DExg distribution can be represented as follows:

    κ(i)=E(Zi|Zi); i=0,1,2,3,...=1Pr(Zi;ζ,τ)z=i(zi)Pr(Z=z;ζ,τ)={1eτ1;ζ0τ2([i2+2i+1]e2τ[2i2+2i1]eτ+i2)+2τ([i+1]e2τ[2i+1]eτ+i)+2(e2τ2eτ+1)τ2(i2e3τ3i2e2τ+3i2eτi2)+2τ(ie3τ3ie2τ+3ieτi)+2(e3τ3e2τ+3eτ1);ζ.

    Similarly, the MIT, say κ(i), for the DExg model can be formulated as follows:

    κ(i)=E(iZ|Zi); κ(0)=0={(ieτ(i+1)[i+1]eiτ+1)eτ(eτ(i+1)1)(eτ1);ζ0A(i,τ)B(i,τ);ζ,

    where

    A(i,τ)=τ2([2i+i2+1]e3τeτ(i+3)[2i2+2i1]e2τeτ(i+2)+i2eτ)+2τ([i+1]e3τ[2i+1]e2τeτ(i+3)+eτ(i+2)+ieτ)+2(ieτ(i+4)[1+3i]eτ(i+3)+[2+3i]eτ(i+2)[1+i]eτ(i+1)+e3τ2e2τ+eτ),

    and

    B(i,τ)=τ2([i22i1]e3τ+3[2i+i2+1]e2τ3[i2+2i+1]eτ+i2+2i+1)+2τ([i+1][e3τ+3e2τ3eτ]+i)+2(eτ(i+4)3eτ(i+3)+3eτ(i+2)eτ(i+1)e3τ+3e2τ2eτ+1).

    Table 3 lists some numerical calculations for the MAT and MIT discussed in this section.

    Table 3.  The MAT and MIT utilizing the DExg distribution.
    Measure ζτ 0.1 0.3 0.5 0.7 0.9 1.5 5.0 10.0
    0.5 21.4064 4.7669 2.2653 1.3568 0.9057 0.3588 0.0081 0.00005
    MAT 0.7 21.4335 4.7783 2.2707 1.3599 0.9077 0.3595 0.0082 0.00005
    0.9 21.4486 4.7846 2.2738 1.3616 0.9088 0.3599 0.0082 0.00005
    0.5 3.1422 4.7246 6.3986 7.6900 8.5075 9.4929 9.9908 9.99993
    MIT 0.7 3.0431 4.5407 6.1623 7.4690 8.3339 9.4258 9.9899 9.99993
    0.9 2.9849 4.4279 6.0066 7.3113 8.2009 9.3672 9.9890 9.99993

     | Show Table
    DownLoad: CSV

    Table 3 illustrates how the two measures, MAT and MIT, change in response to variations in the parameters ζ and τ when utilizing the DExg distribution. The distribution demonstrates sensitivity to both parameters, with significant changes in behavior depending on their specific combination. For MAT, as τ increases while ζ remains constant, the values tend to decrease, indicating a quicker approach to equilibrium or progressively shorter absolute times approaching zero. Conversely, for MIT, with ζ held constant, increasing τ results in values that rise and eventually stabilize around 10, signifying a longer interarrival time.

    Consider a random sample Z1, Z2,,Zm drawn from the DExg model. The log-likelihood function (L) for the DExg distribution can be written as

    L(z_|ζ,τ)=nlog(ζτ2ζ+τ2)τmj=1zj+mj=1log[η(zj;ζ,τ)eτη(zj+1;ζ,τ)]. (4.1)

    Taking the derivative of the log-likelihood with respect to the parameters ζ and τ and equating it to zero, we get the normal equations in the following form

    nτ2ζ(ζ+τ2)1ζ2mj=11+eτη(zj;ζ,τ)eτη(zj+1;ζ,τ)=0,
    2nζτ(ζ+τ2)mj=1zj+mj=12τ3zjτ2[eτ(2τ3(zj+1)τ2)η(zj+1;ζ,τ)eτ]η(zj;ζ,τ)eτη(zj+1;ζ,τ)=0,

    respectively. An analytical solution for this equation cannot be obtained; thus, a numerical iterative method must be employed using R software. In our study, we estimate the parameters ζ and τ of the DExg distribution using the optim() function from the base stats package. The optimization is performed by defining the negative log-likelihood function and minimizing it to obtain the best-fitting parameter estimates. Given the flexibility and efficiency of optim(), it is a suitable choice for our model's parameter estimation. For more complex likelihood models, mle2() from bbmle or maxLik() from maxLik could provide alternative approaches with more optimization techniques and statistical diagnostics.

    Consider a random sample from the DExg model, with the order statistics Z(1),Z(2),,Z(m). The least-squares estimator (LSE) of the DExg parameters, denoted ˆζLS and ˆτLS, can be reported by solving the non-linear equation defined as follows:

    mj=1[1ζτ2eτ(z(j)+1)ζ+τ2η(z(j)+1;ζ,τ)jm+1]Δr(z(j)|ζ,τ)=0; r=1,2, (4.2)

    with respect to the parameters ζ and τ, where

    Δ1(z(j)|ζ,τ)=ζF(z(j)|ζ,τ)=τ2eτ(z(j)+1)(ζ+τ2)[1ζτ2η(z(j)+1;ζ,τ)(ζ+τ2)],Δ2(z(j)|ζ,τ)=τF(z(j)|ζ,τ)=ζτ2eτ(z(j)+1)(2τ3+(z(j)+1)τ2)ζ+τ2+η(z(j)+1;ζ,τ)×eτ(z(j)+1)[(ζ+τ2)ζ[τ2(z(j)+1)+2τ]+2ζτ3](ζ+τ2)2. (4.3)

    Note that the solution of Δr(z(j)|ζ,τ)=0; r=1,2 can be obtained numerically. The weighted LSE (WLSE), say ˆζWLS and ˆτWLS, can be derived by solving the non-linear equation defined by

    mj=1(m+1)2(m+2)j(mj+1)[1ζτ2eτ(z(j)+1)ζ+τ2η(z(j)+1;ζ,τ)jm+1]Δr(z(j)|ζ,τ)=0; r=1,2, (4.4)

    with respect to the parameters ζ and τ, where Δr(z(j)|ζ,τ) is defined in Eq (4.3). To estimate the parameters ζ and τ of theDExg model using LSE and WLSE, we rely on solving the nonlinear equations numerically. Several R packages provide robust tools for this task. The nleqslv package is particularly effective for solving nonlinear systems, allowing direct computation of the estimators by finding the roots of the equations. Similarly, the rootSolve package offers an alternative multivariate root-finding approach. When direct root-finding is challenging, the optim function from the stats package can be used to minimize the squared sum of the LSE or WLSE equations, transforming the problem into an optimization framework. These numerical techniques ensure the accurate and efficient estimation of ζ and τ for the DExg model. By incorporating weighted adjustments in WLSE, we account for heteroscedasticity and improve the estimations' reliability. These methods have been applied in our study to obtain the best-fitting parameter values, enhancing the model's applicability to real-world data analysis.

    Assume a random sample Z1, Z2,,Zm drawn from the DExg distribution. The Anderson-Darling estimator (ADE) is another type of minimum distance estimator. The ADE of the DExg parameters, say ˆζAD and ˆτAD, are derived by minimizing

    AD(ζ,τ)=m1mmj=1(2j1)[log{1ζτ2eτ(z(j)+1)ζ+τ2η(z(j)+1;ζ,τ)}+log{ζτ2eτ(z(j)+1)ζ+τ2η(z(j)+1;ζ,τ)}]. (4.5)

    Concerning the parameters ζ and τ, the model is subject to optimization, while the right–tail Anderson-Darling estimator (RADE) of the model parameter is achieved through minimization

    RAD(ζ,τ)=m22mj=1{1ζτ2eτ(z(j:m)+1)ζ+τ2η(z(j:m)+1;ζ,τ)}1mmj=1(2j1)[log{ζτ2eτ(z(m+1j:m)+1)ζ+τ2η(z(m+1j:m)+1;ζ,τ)}], (4.6)

    with respect to the parameters ζ and τ. To solve the ADE and RADE for the DExg model in R, numerical optimization can be performed using packages such as optim(), nlminb(), DEoptim, or GenSA to minimize the respective objective functions with respect to the parameters ζ and τ. In our study, the optim() function is applied for the numerical optimization of the objective functions.

    In this section, we explore the estimation of the DExg parameters using the MPSE method with a complete sample. Consider a random sample Z1, Z2,,Zn drawn from the DExg distribution. For j=1,2,,m+1, let

    Wj(ζ,τ)=ζτ2eτ(z(j1)+1)ζ+τ2η(z(j1)+1;ζ,τ)ζτ2eτ(z(j)+1)ζ+τ2η(z(j)+1;ζ,τ),

    be the uniform spacings of a random sample from the DExg distribution, where F(z(0)|ζ,τ)=0, F(z(m+1)|ζ,τ)=1, and m+1j=1Wj(ζ,τ)=1. The MPSE of ζ and τ, say ˆζMPS and ˆτMPS, can be derived by maximizing the geometric mean of the spacings

    V(ζ,τ)=[m+1j=1{ζτ2eτ(z(j1)+1)ζ+τ2η(z(j1)+1;ζ,τ)ζτ2eτ(z(j)+1)ζ+τ2η(z(j)+1;ζ,τ)}]1m+1, (4.7)

    with respect to the parameters ζ and τ. In R, the MPSE can be computed using multiple approaches, although there is no specific built-in function for this purpose. Instead, optimization functions such as optim(), nlm(), or nlminb() can be utilized to determine the MPSE by maximizing the product of the spacings, which is equivalent to minimizing the negative logarithm of the spacings' product. In our study, we employed the optim() function to estimate the parameters ζ and τ of the DExg distribution. The optimization was performed by defining an appropriate objective function based on the negative log-product of spacings, allowing us to obtain accurate estimates for ζ and τ that had the best fit to the observed data.

    The CVME represents the difference between the estimated CDF and the empirical CDF. To estimate the CVME of the DExg parameters, one must solve the following non–linear equation:

    mj=1[1ζτ2eτ(z(j)+1)ζ+τ2η(z(j)+1;ζ,τ)2j12m]Δr(z(j)|ζ,τ)=0, (4.8)

    with respect to the parameters ζ and τ, where Δr(z(j)|ζ,τ) is defined in Equation (4.3). To solve this equation numerically in R, optimization techniques such as optim() or nlminb() can be applied, where the objective function is defined by the sum of the weighted differences between the estimated CDF and the empirical CDF, using the relevant expressions for Δr(z(j)|ζ,τ).

    Let wj=j/(m+1) be an unbiased estimator of F(z(j)|ζ,τ). Hence, the PCE of the parameter β, denoted by ˆζPC and ˆτPC, can be reported by minimizing

    P(ζ,τ)=mj=1(z(j)D(wj))2,

    with respect to the parameters ζ and τ, where D(wj)=F1(z(j)|ζ,τ) is the quantile function of the DExg model. To solve for ζ and τ, we can use numerical optimization in R, specifically utilizing packages like optim() or nlminb(), to minimize the sum of squared differences between the observed data z(j) and the predicted quantiles D(wj).

    Simulation ranking techniques are a powerful tool for evaluating and comparing the performance of different statistical estimators or models. The process begins by generating a large number of simulated datasets, typically using known parameter values or distributions, which allows researchers to test estimators under controlled conditions. Various estimation methods, such as MPSE, ADE, MLE, LSE, RADE, PCE, CVME, and WLSE, are applied to these datasets to estimate the model parameters. The performance of each estimator is then evaluated by calculating error metrics like bias (the difference between the true and estimated parameters), mean squared error (MSE), and mean relative error (MRE), which quantify how accurately the estimators recover the true values. Once these error metrics are computed, the estimators are ranked according to their performance, with the estimator showing the smallest error metrics being considered the best. The process is typically repeated across a large number of simulation replications (e.g., 10,000 or more samples) to account for variability and ensure that the results are not due to random fluctuations. Simulation ranking techniques are particularly useful when dealing with complex models or when analytical solutions for comparing estimators are not feasible. They provide robust and comprehensive insights into the strengths and weaknesses of different estimation methods under various conditions, such as varying sample sizes, different parameter settings, or specific data characteristics.

    In this section, we evaluate the effectiveness of MPSE, ADE, MLE, LSE, RADE, PCE, CVME, and WLSE in relation to the sample size n using R software with DExg parameters, where a random variable Z is generated from the DExg distribution by discretizing a value X drawn from a continuous distribution, followed by Markov Chain Monte Carlo (MCMC) simulations, with a simulation study involving the generation of N=10,000 samples across four schemes with different parameter settings ζ and τ (Scheme Ⅰ: ζ=0.5, τ=0.2; Scheme Ⅱ: ζ=0.2, τ=0.4; Scheme Ⅲ: ζ=0.4, τ=0.2; Scheme Ⅳ: ζ=0.2, τ=0.5) with varying sample sizes (n1=20,  n2=50,  n3=150,  n4=300), and the computation of the estimators and error metrics such as bias, MSE, MRE for these samples, where

    |Bias(ϵ)|=1NNk=1|^ϵkϵk|,  MSE(ϵ)=1NNk=1(^ϵkϵk)2, MRE(ϵ)=1NNk=1|^ϵkϵk|ϵk .

    The MSE measures the average squared difference between the predicted and actual values, with a lower MSE indicating that predictions are closer to the actual values. In contrast, the MRE represents the average relative difference as a percentage, providing insights into accuracy and normalization across different data magnitudes. While the MSE emphasizes precision by squaring errors, the MRE considers the relative size of the errors. The MSE can be sensitive to outliers, whereas the MRE may be less affected when expressed in percentage terms. Although the MSE is less interpretable due to its squared units, the MRE offers a standardized measure of error as a percentage. The choice between MSE and MRE depends on the characteristics of the data and whether the focus is on precision or accuracy in the predictions. The empirical results from the simulation are presented in Tables 48 and Figures 47.

    Table 4.  Simulation results of Scheme Ⅰ.
    n Est Par ML WLS PC RAD MPS AD CVM OLS
    20 |Bias| ˆζ 0.16861 0.19334 0.21306 0.21627 0.19425 0.22698 0.17222 0.18163
    ˆτ 0.29712 0.29783 0.35376 0.36607 0.30264 0.44308 0.29421 0.34675
    MSE ˆζ 0.02801 0.03694 0.04476 0.04617 0.03725 0.02922 0.05088 0.03253
    ˆτ 0.08712 0.08753 0.12346 0.13217 0.09034 0.19378 0.08541 0.11865
    MRE ˆζ 0.33731 0.38674 0.42606 0.43257 0.38855 0.45398 0.34442 0.36323
    ˆτ 0.19812 0.19853 0.23586 0.24407 0.20174 0.29538 0.19611 0.23115
    Sum of ranks 91 213 366 427.5 275 427.5 152 244
    50 |Bias| ˆζ 0.10701 0.11834 0.13106 0.13127 0.11985 0.15188 0.11452 0.11723
    ˆτ 0.18571 0.18952 0.21135 0.22077 0.18983 0.33798 0.19154 0.21316
    MSE ˆζ 0.01131 0.01383 0.01686 0.01697 0.01414 0.02278 0.01292 0.01575
    ˆτ 0.03401 0.03542 0.04416 0.04807 0.03553 0.11278 0.03624 0.04505
    MRE ˆζ 0.21411 0.23674 0.26216 0.26247 0.23965 0.30368 0.22912 0.23453
    ˆτ 0.12381 0.12632 0.14095 0.14717 0.12653 0.22538 0.12774 0.14246
    Sum of ranks 61 172 346 427 234 488 183 275
    150 |Bias| ˆζ 0.05951 0.06413 0.07587 0.07486 0.06654 0.08788 0.06242 0.06755
    ˆτ 0.10211 0.10912 0.12386 0.12285 0.11564 0.21608 0.11083 0.12787
    MSE ˆζ 0.00351 0.00403 0.00567 0.00556 0.00434 0.00768 0.00382 0.00455
    ˆτ 0.01021 0.01172 0.01516 0.01495 0.01324 0.04608 0.01213 0.01617
    MRE ˆζ 0.11911 0.12833 0.15177 0.14976 0.13304 0.17578 0.12462 0.13515
    ˆτ 0.06801 0.07272 0.08256 0.08195 0.07714 0.14408 0.07393 0.08537
    Sum of ranks 61 152.5 397 335 244 488 152.5 366
    300 |Bias| ˆζ 0.04171 0.04462 0.05016 0.05067 0.04825 0.06438 0.04704 0.04643
    ˆτ 0.07261 0.07272 0.08295 0.08526 0.07724 0.16308 0.07663 0.08917
    MSE ˆζ 0.00171 0.00192 0.00246 0.00257 0.00225 0.00408 0.00203 0.00214
    ˆτ 0.00521 0.00532 0.00685 0.00716 0.00594 0.02628 0.00573 0.00787
    MRE ˆζ 0.08341 0.08912 0.10026 0.10127 0.09645 0.12868 0.09404 0.09283
    ˆτ 0.04851 0.04912 0.05535 0.05686 0.05154 0.10878 0.05103 0.05947
    Sum of ranks 61 122 336 397 274 488 203 315

     | Show Table
    DownLoad: CSV
    Table 5.  Simulation results of Scheme Ⅱ.
    n Est Par ML WLS PC RAD MPS AD CVM OLS
    20 |Bias| ˆζ 0.13901 0.15365 0.16277 0.16226 0.14804 0.17898 0.13992 0.14253
    ˆτ 0.67492 0.67693 0.74746 0.72825 0.70204 0.93898 0.66211 0.78067
    MSE ˆζ 0.01901 0.02335 0.02617 0.02596 0.02164 0.03168 0.01932 0.02083
    ˆτ 0.44952 0.45223 0.55126 0.52335 0.48634 0.86988 0.43251 0.60137
    MRE ˆζ 0.27781 0.30735 0.32547 0.32446 0.29604 0.35798 0.27992 0.28503
    ˆτ 0.22492 0.22563 0.24916 0.24275 0.23404 0.31298 0.22071 0.26027
    Sum of ranks 91.5 243.5 397 336 243.5 488 91.5 305
    50 |Bias| ˆζ 0.08181 0.09464 0.09966 0.10277 0.09002 0.12358 0.09103 0.09645
    ˆτ 0.40251 0.43604 0.47976 0.46915 0.42472 0.65648 0.42763 0.48847
    MSE ˆζ 0.00661 0.00884 0.00986 0.01047 0.00802 0.01508 0.00813 0.00915
    ˆτ 0.15991 0.18754 0.22716 0.21715 0.17802 0.42528 0.18043 0.23547
    MRE ˆζ 0.16371 0.18934 0.19936 0.20547 0.18012 0.24688 0.18203 0.19295
    ˆτ 0.13411 0.14534 0.15996 0.15645 0.14152 0.21888 0.14253 0.16287
    Sum of ranks 61 244 366 366 122 488 183 366
    150 |Bias| ˆζ 0.05031 0.05042 0.05877 0.05806 0.05303 0.07148 0.05364 0.05615
    ˆτ 0.22841 0.23612 0.26955 0.27536 0.25394 0.42798 0.25193 0.27707
    MSE ˆζ 0.00241 0.00252 0.00337 0.00326 0.00273 0.00508 0.00284 0.00315
    ˆτ 0.05141 0.05512 0.07175 0.07486 0.06364 0.18078 0.06263 0.07577
    MRE ˆζ 0.10051 0.10082 0.11737 0.11616 0.10603 0.14298 0.10734 0.11235
    ˆτ 0.07611 0.07872 0.08984 0.09176 0.08465 0.14268 0.08393 0.09237
    Sum of ranks 61 122 355 366.5 224 488 213 366.5
    300 |Bias| ˆζ 0.03401 0.03613 0.04126 0.04177 0.03714 0.05368 0.03512 0.03985
    ˆτ 0.16411 0.16913 0.19306 0.18805 0.17004 0.31958 0.16862 0.20757
    MSE ˆζ 0.00111 0.00133 0.00166 0.00187 0.00144 0.00288 0.00122 0.00155
    ˆτ 0.02651 0.02823 0.03676 0.03485 0.02854 0.10078 0.02802 0.04257
    MRE ˆζ 0.06801 0.07223 0.08246 0.08357 0.07424 0.10748 0.07032 0.07975
    ˆτ 0.05511 0.05633 0.06436 0.06265 0.05674 0.10658 0.05622 0.06917
    Sum of ranks 61 183 366 366 244 488 122 366

     | Show Table
    DownLoad: CSV
    Table 6.  Simulation results of Scheme Ⅲ.
    n Est Par ML WLS PC RAD MPS AD CVM OLS
    20 |Bias| ˆζ 0.32921 0.38575 0.42816 0.40408 0.37193 0.42867 0.34222 0.38204
    ˆτ 0.76233 0.76202 0.87435 0.88467 0.76904 0.98428 0.75941 0.88356
    MSE ˆζ 0.10691 0.14685 0.18087 0.16106 0.13643 0.18128 0.11552 0.14404
    ˆτ 0.57313 0.57292 0.75427 0.77236 0.58354 0.95598 0.56911 0.77025
    MRE ˆζ 0.21941 0.25715 0.28547 0.26936 0.24793 0.28578 0.22812 0.25464
    ˆτ 0.25403 0.25392 0.29145 0.29487 0.25634 0.32808 0.25311 0.29456
    Sum of ranks 122 213.5 376 407 213.5 478 91 295
    50 |Bias| ˆζ 0.20851 0.21433 0.24537 0.24446 0.21814 0.28898 0.21312 0.23355
    ˆτ 0.44611 0.47262 0.53465 0.54316 0.49414 0.69508 0.47383 0.54367
    MSE ˆζ 0.04291 0.04533 0.05947 0.05896 0.04694 0.08248 0.04482 0.05385
    ˆτ 0.19641 0.22042 0.28205 0.29106 0.24094 0.47678 0.22153 0.29157
    MRE ˆζ 0.13901 0.14283 0.16357 0.16296 0.14544 0.19268 0.14202 0.15565
    ˆτ 0.14871 0.15752 0.17825 0.18106 0.16474 0.23168 0.15793 0.18127
    Sum of ranks 61 152.5 366 366 244 488 152.5 366
    150 |Bias| ˆζ 0.11671 0.12512 0.13246 0.13747 0.12663 0.17158 0.12824 0.12945
    ˆτ 0.25121 0.26812 0.31126 0.30185 0.27613 0.44728 0.27944 0.31517
    MSE ˆζ 0.01341 0.01542 0.01736 0.01867 0.01593 0.02898 0.01624 0.01655
    ˆτ 0.06221 0.07092 0.09556 0.08995 0.07523 0.19738 0.07704 0.09797
    MRE ˆζ 0.07781 0.08332 0.08836 0.09167 0.08443 0.11418 0.08544 0.08625
    ˆτ 0.08371 0.08932 0.10376 0.10065 0.09203 0.14918 0.09314 0.10507
    Sum of ranks 61 122 366 366 183 488 244 366
    300 |Bias| ˆζ 0.08482 0.08341 0.09846 0.10077 0.08924 0.12448 0.08543 0.09445
    ˆτ 0.17901 0.18873 0.21595 0.22296 0.19674 0.31828 0.18782 0.23077
    MSE ˆζ 0.00712 0.00681 0.00956 0.01007 0.00784 0.02898 0.00723 0.00885
    ˆτ 0.03161 0.03513 0.04605 0.04906 0.03824 0.19738 0.03482 0.05257
    MRE ˆζ 0.05652 0.05561 0.06566 0.06717 0.05954 0.08298 0.05693 0.06295
    ˆτ 0.05961 0.06293 0.07195 0.07436 0.06554 0.10608 0.06262 0.07697
    Sum of ranks 91 122 335 397 245 488 154 366

     | Show Table
    DownLoad: CSV
    Table 7.  Simulation results of Scheme Ⅳ.
    n Est Par ML WLS PC RAD MPS AD CVM OLS
    20 |Bias| ˆζ 0.56051 0.62495 0.70537 0.69676 0.60984 0.73968 0.56232 0.58623
    ˆτ 0.09393 0.09372 0.10355 0.10627 0.09231 0.17458 0.09504 0.10536
    MSE ˆζ 0.31021 0.38536 0.03162 0.47907 0.36705 0.53988 0.31203 0.33914
    ˆτ 0.00873 0.00862 0.00985 0.01117 0.00841 0.03008 0.00894 0.01106
    MRE ˆζ 0.37381 0.41675 0.47027 0.46456 0.40654 0.49318 0.37482 0.39083
    ˆτ 0.18783 0.18742 0.20695 0.21257 0.18461 0.34908 0.19004 0.21066
    Sum of ranks 121 224 316 407 162 488 193 285
    50 |Bias| ˆζ 0.35141 0.38404 0.42166 0.42797 0.39755 0.50358 0.36592 0.37003
    ˆτ 0.05701 0.06043 0.06546 0.06295 0.06084 0.13288 0.05812 0.06787
    MSE ˆζ 0.12181 0.14554 0.16786 0.18077 0.15605 0.25018 0.13212 0.13513
    ˆτ 0.00311 0.00363 0.00446 0.00395 0.00374 0.01748 0.00332 0.00457
    MRE ˆζ 0.23421 0.25604 0.28106 0.28527 0.26505 0.33568 0.24392 0.24673
    ˆτ 0.11341 0.12093 0.13086 0.12605 0.12164 0.26568 0.11612 0.13577
    Sum of ranks 61 213 366.5 366.5 274 488 122 305
    150 |Bias| ˆζ 0.19671 0.20993 0.26217 0.24346 0.22575 0.31668 0.21564 0.20312
    ˆτ 0.03161 0.03252 0.03827 0.03524 0.03433 0.08568 0.03535 0.03656
    MSE ˆζ 0.03811 0.04343 0.17538 0.05846 0.05025 0.09897 0.04594 0.04072
    ˆτ 0.00091 0.00102 0.00427 0.00146 0.00124 0.00728 0.00113 0.00135
    MRE ˆζ 0.13111 0.14003 0.17477 0.16236 0.15055 0.21118 0.14374 0.13542
    ˆτ 0.06331 0.06512 0.07647 0.07044 0.06863 0.17148 0.07055 0.07316
    61 152 437 326 254.5 478 254.5 233
    300 |Bias| ˆζ 0.13671 0.14703 0.17916 0.18337 0.15024 0.22308 0.15325 0.14612
    ˆτ 0.02281 0.02434 0.02847 0.02665 0.02332 0.06418 0.02393 0.02676
    MSE ˆζ 0.01841 0.02133 0.49097 0.03316 0.02224 0.04908 0.02315 0.02102
    ˆτ 0.00031 0.00064 0.01058 0.00075 0.00042 0.00407 0.00053 0.00096
    MRE ˆζ 0.09121 0.09803 0.11946 0.12257 0.10014 0.14878 0.10205 0.09742
    ˆτ 0.04571 0.04874 0.05697 0.05315 0.04672 0.12838 0.04793 0.05346
    Sum of ranks 61 213 417 356 182 478 244.5 244.5

     | Show Table
    DownLoad: CSV
    Table 8.  Ranking of the estimation methods based on the simulation results.
    n ML WLS PC RAD MPS AD CVM OLS
    Scheme Ⅰ 20 1 3 6 7.5 5 7.5 2 4
    50 1 2 6 7 4 8 3 5
    150 1 2.5 7 5 4 8 2.5 6
    300 1 2 6 7 4 8 3 5
    Scheme Ⅱ 20 1.5 3.5 7 6 3.5 8 1.5 5
    50 1 4 6 6 2 8 3 6
    150 1 2 5 6.5 4 8 3 6.5
    300 1 3 6 6 4 8 2 6
    Scheme Ⅲ 20 2 3.5 6 7 3.5 8 1 5
    50 1 2.5 6 6 4 8 2.5 6
    150 1 2 6 6 3 8 4 6
    300 1 2 5 7 5 8 4 6
    Scheme Ⅳ 20 1 4 6 7 2 8 13 5
    50 1 3 6.5 6.5 4 8 2 5
    150 1 2 7 6 4.5 8 4.5 3
    300 1 3 7 6 2 8 4.5 4.5
    Sum of ranks 17.5 44 98.5 102.5 58.5 127.5 45.5 84
    Mean of ranks 1.09 2.75 6.16 6.41 3.66 7.97 2.84 5.25
    SD of ranks 0.27 0.73 0.63 0.64 0.96 0.13 1.03 0.91
    Overall rank 1 2 6 7 4 8 3 5

     | Show Table
    DownLoad: CSV
    Figure 4.  The bias of the estimation methods for Scheme Ⅰ.
    Figure 5.  The MSE of the estimation methods for Scheme Ⅰ.
    Figure 6.  The MRE of the estimation methods for Scheme Ⅰ.
    Figure 7.  The mean and standard deviation of the ranks for the different estimation methods.

    From Tables 4 to 7, it is clear that as the sample size n increases, the bias of the parameters ζ and τ decreases, approaching zero. Similarly, both the MSE and MRE of the DExg parameters improve as the sample size grows, indicating the reliability of the estimators. Additionally, all estimation methods perform well across various sample sizes. As shown in Table 8 and Figure 7, the MLE method stands out as the top performer, achieving the lowest rank across the sum of ranks, mean of ranks, standard deviation of ranks, and overall rank. This highlights MLE as the most effective method among the ones tested.

    Within this section, we demonstrate the flexibility of the DExg distribution in effectively modeling diverse datasets across different fields. The fitted distributions are evaluated by employing the Chi-square (χ2) test along with its associated P–value. The distribution that provided the best fit was chosen on the basis of the lowest χ2 statistic and the highest P-value. A low P–value (typically below 0.05) indicates a poor fit, while a high P–value suggests a good fit between the observed data and the fitted distribution. The fit of the DExg distribution is compared with those of other competing distributions, including the discrete inverted Nadarajah–Haghighi (DINH), discrete inverse Weibull (DIW), discrete Burr Type Ⅱ (DBX–II), geometric (Geo), Poisson (Poi), discrete Pareto (DPa), discrete Rayleigh (DR), and discrete inverse Rayleigh (DIR).

    Radiation biology data, including studies on agents like streptonigrin, examines the effects of radiation and radiation-like substances on living organisms. Streptonigrin, a genotoxic agent, induces DNA damage and cytogenetic lesions, making it a valuable tool for studying cellular responses to stress. These data are essential for understanding mechanisms like DNA repair, apoptosis, and mutagenesis caused by such agents. It also aids in developing targeted cancer therapies and radioprotective agents to minimize damage to healthy tissues. By integrating biological research with biotechnology, the study of substances like streptonigrin enhances our understanding of genotoxicity and its applications in medicine and environmental safety. In this section, the dataset, referred to as "Dataset Ⅰ", pertains to mammalian cytogenetic dosimetry, specifically examining lesions in rabbit lymphoblasts induced by the genotoxic agent streptonigrin (NSC-45383). The exposure level for the study was set at 60 μ g/kg. This dataset provides insights into the cytogenetic effects of streptonigrin, offering a basis for understanding its impact on cellular structures and functions. For a detailed description of the experimental setup and findings, refer to the study by Shanker and Fesshaye [24]. Figure 8 illustrates the distribution of the data, which exhibits a right-skewed asymmetry. This skewness is attributed to the presence of several extreme observations that influence the overall distribution. Additionally, the data shows a unimodal shape, characterized by a single peak. The distribution is platykurtic, meaning it has a flatter peak and lighter tails compared with a normal distribution. Furthermore, there is evidence of overdispersion, indicating that the variance exceeds the mean, suggesting that the data's variability is higher than expected under a standard model.

    Figure 8.  Non–parametric plots for Dataset Ⅰ.

    Table 9 presents the results of the goodness-of-fit test for Dataset Ⅰ, evaluating the performance of several statistical models applied to the data. The models considered include the DExg, DB-XII, and Geo models. Figure 9 visually supports these empirical results, highlighting how each model fits the observed data. The figure compares the predictions from each model with the actual data, providing a clear view of the discrepancies and overall fit. While all three models are capable of modeling the data, the DExg model stands out as the most effective, offering the best fit and demonstrating superior accuracy in capturing the distribution of the data. Furthermore, Figure 10 proves the uniqueness property of the model estimators, confirming that the parameter estimates derived from the DExg model are stable and uniquely determined for the given dataset. This figure further reinforces the reliability of the DExg model, making it the most suitable choice for this analysis. Together, Table 9, Figure 9, and Figure 10 provide a thorough assessment of the model's performance, with the DExg model being identified as the optimal choice for Dataset Ⅰ.

    Table 9.  The goodness–of–fit test for Dataset Ⅰ.
    X OF DExg DINH DIW DB-XII Geo Poi DPa DR DIR
    0 413 413.29 414.58 411.60 412.71 407.68 374.05 447.06 273.43 411.86
    1 124 123.96 91.62 136.59 132.73 131.14 177.38 84.54 274.53 134.96
    2 42 41.61 31.37 29.13 33.25 42.18 42.06 29.96 50.49 29.47
    3 15 14.52 15.79 10.39 11.29 13.57 6.65 13.99 2.515 10.68
    4 5 5.05 9.50 4.82 4.79 4.36 0.79 7.66 0.04 5.01
    5 0 1.72 6.34 2.61 2.36 1.40 0.07 4.64 0.00 2.74
    6 2 0.84 31.80 5.86 3.87 0.67 0.00 13.15 0.00 6.28
    Total 601 601 601 601 601 601 601 601 601
    l 556.197 595.708 564.056 560.131 556.516 582.677 580.141 700.860 564.095
    MLEζ 0.198 23.1217 0.684 0.187 0.322 0.474 0.140 0.545 0.685
    MLEτ 1.363 0.0137 2.041 1.653
    χ2 0.069 51.489 12.234 5.563 0.660 58.562 39.711 156.05 11.861
    d.f 2 4 3 2 3 3 4 1 4
    P–value 0.966 <0.001 0.007 0.062 0.883 <0.001 <0.001 <0.001 0.018

     | Show Table
    DownLoad: CSV
    Figure 9.  The estimated PMF based on Dataset Ⅰ.
    Figure 10.  The log-likelihood profiles based on Dataset Ⅰ.

    Table 10 displays the estimation techniques applied to Dataset Ⅰ, highlighting various methods used to estimate the model parameters. It is evident that all of the methods can be applied to the data; however, the MLE stands out as the most effective approach.

    Table 10.  Various estimators for Dataset Ⅰ.
    ML MPS OLS WLS CVM PC AD RAD
    ˆζ 0.198 0.192 0.195 0.196 0.194 0.186 0.187 0.189
    ˆτ 1.363 1.371 1.331 1.358 1.371 1.412 1.284 1.295
    χ2 0.069 0.134 0.484 0.078 0.117 1.545 2.319 1.764
    d.f 2 2 2 2 2 2 2 2
    P.value 0.966 0.935 0.785 0.962 0.943 0.462 0.314 0.414

     | Show Table
    DownLoad: CSV

    Industrial engineering focuses on optimizing complex systems and processes to improve efficiency and productivity across various industries. It involves analyzing workflows, reducing waste, and ensuring that resources are utilized in the most effective way possible. One key aspect of industrial engineering is the management of machines' performance, where engineers monitor and address issues such as computer breakdowns occurring over a period. These breakdowns can disrupt production schedules and lead to downtime, which significantly impacts overall productivity. Industrial engineers use predictive maintenance strategies and data analysis to anticipate equipment failures, minimizing downtime and improving the reliability of manufacturing systems. This section presents a dataset that records the number of computer breakdowns occurring over a span of 128 consecutive weeks of operation (see Hand et al. [25]). Figure 11 illustrates the distribution of Dataset Ⅱ. The results reveal a moderate rightward skew, likely due to the presence of some extreme observations, along with a relatively high level of variability. The kurtosis indicates that the distribution has slightly heavier tails than a normal distribution, though not excessively so, and shows evidence of overdispersion.

    Figure 11.  Non–parametric plots for Dataset Ⅱ.

    Table 11 presents the results of the goodness–of–fit test for Dataset Ⅱ, assessing the performance of various statistical models applied to the data, including the DExg and Geo models. Figure 12 visually complements these results, illustrating how each model fits the observed data by comparing their predictions to the actual data and highlighting any discrepancies and the overall fit. While all three models are capable of modeling the data, the DExg model stands out as the most effective, providing the best fit and demonstrating superior accuracy in capturing the data's distribution. Additionally, Figure 13 confirms the uniqueness property of the model estimators, showing that the parameter estimates derived from the DExg model are stable and uniquely determined for the given dataset. This further emphasizes the reliability of the DExg model, establishing it as the most suitable choice for this analysis. Together, Table 11, Figure 9, and Figure 13 offer a comprehensive evaluation of the models' performance, with the DExg model identified as the optimal choice for Dataset Ⅱ.

    Table 11.  The goodness–of–fit test for Dataset Ⅱ.
    X OF DExg DINH DIW DB-XII Geo Poi DPa DR DIR
    0 15 19.04 12.40 9.69 19.89 25.53 2.31 47.81 3.64 3.24
    1 19 16.47 30.16 33.09 36.59 20.43 9.27 19.19 10.30 47.81
    2 23 16.06 19.94 23.10 19.05 16.36 18.61 10.76 15.31 34.02
    3 14 15.23 12.78 14.48 10.69 13.09 24.91 7.02 18.04 16.65
    4 15 13.61 8.72 9.54 6.83 10.49 25.01 5.00 18.44 8.77
    5 10 11.52 6.29 6.64 4.76 8.39 20.09 3.77 16.92 5.08
    6 8 9.33 4.73 4.84 3.52 6.72 13.44 2.97 14.17 3.17
    7 4 7.28 3.69 3.65 2.72 5.38 7.71 2.40 10.94 2.11
    8 6 5.52 2.95 2.85 2.17 4.31 3.87 1.99 7.84 1.47
    9 2 4.09 2.41 2.27 1.77 3.46 1.74 1.69 5.23 1.06
    10 3 2.97 2.01 1.85 1.48 2.75 0.69 1.45 3.26 0.79
    11 3 2.12 1.69 1.53 1.25 2.21 0.25 1.26 1.89 0.60
    12 2 1.49 1.46 1.28 1.08 1.77 0.08 1.11 1.04 0.47
    +13 4 3.27 18.758 13.183 16.188 7.114 0.006 21.586 0.985 2.762
    Total 128 128 128 128 128 128 128 128 128 128
    l 318.953 331.931 330.446 342.581 320.703 384.974 369.766 347.148 356.525
    MLEζ 0.497 1.244 0.076 0.785 0.801 4.016 0.509 0.972 0.025
    MLEτ 0.513 1.634 1.235 3.309
    χ2 7.093 20.199 18.709 40.183 11.651 88.995 94.971 49.897 50.533
    d.f 8 6 6 5 9 6 6 8 5
    P–value 0.527 0.003 0.005 0 0.234 0 0 0 0

     | Show Table
    DownLoad: CSV
    Figure 12.  The estimated PMF for Dataset Ⅱ.
    Figure 13.  The log–likelihood profiles based on Dataset Ⅱ.

    Table 12 presents the estimation techniques applied to Dataset Ⅱ, showcasing the different methods used to estimate the model parameters. While all methods are applicable to the data, the MLE emerges as the most effective approach.

    Table 12.  Numerical descriptors for characterizing the DExg distribution.
    ML MPS OLS WLS CVM PC AD RAD
    ˆζ 0.497 0.474 0.513 0.483 0.495 0.461 0.501 0.526
    ˆτ 0.513 0.522 0.532 0.512 0.522 0.527 0.582 0.565
    χ2 7.093 7.831 7.932 7.683 7.711 8.153 10.766 11.0434
    d.f 8 8 8 8 8 8 7 8
    P.value 0.527 0.450 0.440 0.465 0.462 0.419 0.149 0.199

     | Show Table
    DownLoad: CSV

    The DExg distribution proves to be a highly suitable and effective model for both radiation biology and industrial engineering applications due to its flexibility and ability to accommodate complex features in data.

    For the radiation biology dataset, the DExg distribution excels in modeling data with skewness, overdispersion, zero–inflation, and heavy tails, which are common in radiation studies. These characteristics make it more adaptable to the irregularities found in radiation exposure data, where the majority of observations might cluster around a certain value with occasional extreme readings. The ability to handle overdispersion, where the variance exceeds the mean, and to effectively model rare, extreme events (heavy tails) provides a better fit compared with simpler models such as the normal or exponential distributions. As a result, the DExg distribution allows for a more accurate representation of biological processes and better risk assessments regarding radiation exposure.

    In industrial engineering, particularly in the prediction and prevention of machine breakdowns, the DExg distribution's ability to model failure times under variable conditions is crucial. Machines typically follow a lifetime distribution that is skewed, with a few units experiencing early failures, while the majority last longer, making it challenging to predict breakdowns with simple models. The DExg distribution's flexibility to model skewed data and handle overdispersion makes it ideal for modeling the time-to-failure data of machines in industrial settings. By effectively capturing the nuances of machines' breakdown behavior, the DExg distribution enables predictive maintenance, reducing downtime and maintenance costs while improving the reliability and efficiency of industrial systems.

    In both fields, the DExg distribution not only enhances predictive accuracy but also provides valuable insights for risk management and decision-making. Its application helps in optimizing resources, scheduling maintenance, and improving systems' reliability, ensuring that both radiation biology studies and industrial operations achieve better outcomes through the application of advanced statistical modeling techniques. Thus, the DExg distribution stands as a powerful tool in both the biological and industrial domains, offering practical solutions to complex data modeling challenges.

    In this paper, the DExg model was introduced, with several key objectives aimed at addressing practical challenges in count data analysis. The model offers a discrete distribution with simple and closed-form solutions for fundamental properties, such as the probability mass function, cumulative distribution function, moments, and others. It was designed to be highly applicable to real-world datasets, including those from industrial and mortality studies, which exhibit bathtub–, or increasing failure rates. Moreover, the DExg model was crafted to accommodate a wide range of data types, such as equi–, over–, and under–dispersed datasets, as well as those with positive skewness or leptokurtic distributions. Various estimation strategies were explored, and through simulations and a ranking methodology, the maximum likelihood estimation method was identified as the most effective. Two distinct datasets from biotechnology and industrial engineering were analyzed, with the proposed model demonstrating superior fit compared with alternative models. Notably, the DExg model outperformed several well-established discrete models in the current statistical literature, further underscoring its robustness and applicability. Future work will focus on extending the model for forecasting and bivariate analysis, as well as performing a sensitivity analysis to assess the impact of model misspecification.

    Conceptualization, Mohamed S. Eliwa; data curation, Mahmoud El-Morshedy and Mohamed S. Algolam; formal analysis, Mohamed S. Eliwa, Mohamed S. Algolam and Mohamed El-Dawoody; investigation, Mahmoud El-Morshedy and Mohamed S. Algolam; methodology, Mahmoud El-Morshedy and Mohamed S. Eliwa; resources, Mohamed S. Algolam and Mohamed El-Dawoody; software, Mahmoud El-Morshedy and Mohamed S. Eliwa; validation, Mohamed S. Eliwa and Mohamed El-Dawoody; writing–review and editing, Mahmoud El-Morshedy and Mohamed S. Eliwa. All authors have read and agreed to the published version of the manuscript.

    The authors declare they have not used AI tools in the creation of this article.

    The authors extend their appreciation to Prince Sattam bin Abdulaziz University for funding this research work through the project number (PSAU/2024/01/31935).

    The authors declare that they have no conflicts of interest.



    [1] J. Melnikow, A. Padovani, M. Miller, Frontline physician burnout during the COVID-19 pandemic: national survey findings, BMC Health Serv. Res., 22 (2022), 365. https://doi.org/10.1186/s12913-022-07728-6 doi: 10.1186/s12913-022-07728-6
    [2] J. Mapes, Using big data to study small places: small-town voting patterns in the 2020 US presidential election, Growth Change, 55 (2024), e12730. https://doi.org/10.1111/grow.12730 doi: 10.1111/grow.12730
    [3] E. Clough, T. Barrett, S. E. Wilhite, P. Ledoux, C. Evangelista, I. F. Kim, et al., NCBI GEO: archive for gene expression and epigenomics data sets: 23-year update, Nucleic Acids Res., 52 (2024), D138–D144. https://doi.org/10.1093/nar/gkad965 doi: 10.1093/nar/gkad965
    [4] S. Yang, D. Meng, A. Díaz, H. Yang, X. Su, A. M. D. Jesus, Probabilistic modeling of uncertainties in reliability analysis of mid-and high-strength steel pipelines under hydrogen-induced damage, Int. J. Struct. Integr., 16 (2025), 39–59. https://doi.org/10.1108/IJSI-10-2024-0177 doi: 10.1108/IJSI-10-2024-0177
    [5] S. Yang, D. Meng, H. Yang, C. Luo, X. Su, Enhanced soft Monte Carlo simulation coupled with support vector regression for structural reliability analysis, In: Proceedings of the Institution of Civil Engineers-Transport, Emerald Publishing Limited, 2024, 1–16. https://doi.org/10.1680/jtran.24.00128
    [6] D. Meng, H. Yang, S. Yang, Y. Zhang, A. M. De Jesus, J. Correia, et al., Kriging-assisted hybrid reliability design and optimization of offshore wind turbine support structure based on a portfolio allocation strategy, Ocean Eng., 295 (2024), 116842. https://doi.org/10.1016/j.oceaneng.2024.116842 doi: 10.1016/j.oceaneng.2024.116842
    [7] S. Sen, S. K. Ghosh, H. Al-Mofleh, The Mirra distribution for modeling time-to-event data sets, In: Dtrategic management, decision theory, and decision science: contributions to policy issues, Singapore: Springer, 2021. https://doi.org/10.1007/978-981-16-1368-5_5
    [8] D. Roy, Discrete Rayleigh distribution, IEEE Trans. Reliab., 53 (2004), 255–260. https://doi.org/10.1109/TR.2004.829161 doi: 10.1109/TR.2004.829161
    [9] M. El-Morshedy, M. S. Eliwa, E. Altun, Discrete Burr-Hatke distribution with properties, estimation methods and regression model, IEEE Access, 8 (2020), 74359–74370. https://doi.org/10.1109/ACCESS.2020.2988431 doi: 10.1109/ACCESS.2020.2988431
    [10] H. Krishna, P. S. Pundir, Discrete Burr and discrete Pareto distributions, Stat. Methodol., 6 (2009), 177–188. https://doi.org/10.1016/j.stamet.2008.07.001 doi: 10.1016/j.stamet.2008.07.001
    [11] T. Hussain, M. Ahmad, Discrete inverse Rayleigh distribution, Pakistan J. Stat., 30 (2014).
    [12] A. E. Abd EL-Hady, M. A. Hegazy, A. A. EL-Helbawy, A discrete exponentiated generalized family of distributions, Comput. J. Math. Stat. Sci., 2 (2023), 303–327.
    [13] A. R. E. Alosey, A. M. Gemeay, A novel version of geometric distribution: method and application, Comput. J. Math. Stat. Scie., 4 (2025), 1–16.
    [14] M. A. Jazi, C. D. Lai, M. H. Alamatsaz, A discrete inverse Weibull distribution and estimation of its parameters, Stat. Methodol., 7 (2010), 121–132. https://doi.org/10.1016/j.stamet.2009.11.001 doi: 10.1016/j.stamet.2009.11.001
    [15] E. Gómez-Déniz, E. Calderín-Ojeda, The discrete Lindley distribution: properties and applications, J. Stat. Comput. Simul., 81 (2011), 1405–1416. https://doi.org/10.1080/00949655.2010.487825 doi: 10.1080/00949655.2010.487825
    [16] J. M. Jia, Z. Z. Yan, X. Y. Peng, A new discrete extended Weibull distribution, IEEE Access, 7 (2019), 175474–175486. https://doi.org/10.1109/ACCESS.2019.2957788 doi: 10.1109/ACCESS.2019.2957788
    [17] E. Altun, A new generalization of geometric distribution with properties and applications, Commun. Stat.-Simul. Comput., 49 (2020), 793–807. https://doi.org/10.1080/03610918.2019.1639739 doi: 10.1080/03610918.2019.1639739
    [18] E. Gómez-Déniz, Another generalization of the geometric distribution, Test, 19 (2010), 399–415. https://doi.org/10.1007/s11749-009-0169-3 doi: 10.1007/s11749-009-0169-3
    [19] M. A. Hegazy, R. E. Abd El-Kader, A. A. El-Helbawy, G. R. Al-Dayian, Bayesian estimation and prediction of discrete Gompertz distribution, J. Adv. Math. Comput. Sci., 36 (2021), 1–21.
    [20] V. Nekoukhou, M. H. Alamatsaz, H. Bidram, Discrete generalized exponential distribution of a second type, Statistics, 47 (2013), 876–887. https://doi.org/10.1080/02331888.2011.633707 doi: 10.1080/02331888.2011.633707
    [21] A. S. Eldeeb, M. Ahsan-ul-Haq, M. S. Eliwa, A discrete Ramos-Louzada distribution for asymmetric and over-dispersed data with leptokurtic-shaped: properties and various estimation techniques with inference, AIMS Math., 7 (2022), 1726–1741. https://doi.org/10.3934/math.2022099 doi: 10.3934/math.2022099
    [22] T. Hussain, M. Aslam, M. Ahmad, A two parameter discrete Lindley distribution, Rev. Colomb. Estad., 39 (2016), 45–61.
    [23] B. C. Arnold, N. Balakrishnan, H. N. Nagaraja, A first course in order statistics, Society for Industrial and Applied Mathematics, 2008.
    [24] R. Shanker, H. Fesshaye, On Poisson-Sujatha distribution and its applications to model count data from biological sciences, Biometrics Biostat. Int. J., 3 (2016), 1–7. https://doi.org/10.15406/bbij.2015.02.00036 doi: 10.15406/bbij.2015.02.00036
    [25] D. J. Hand, F. Daly, K. McConway, D. Lunn, E. Ostrowski, A handbook of small data sets, Chapman & Hall, 1993. https://doi.org/10.1201/9780429246579
  • Reader Comments
  • © 2025 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(362) PDF downloads(38) Cited by(0)

Figures and Tables

Figures(13)  /  Tables(12)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog