This study introduced a two-parameter zero-inflated discrete random variable distribution designed to model failure profiles in zero-inflated, dispersed datasets, commonly found in biological engineering and reliability analysis. The proposed distribution combined traditional count models, such as Poisson, Lindley, or negative binomial, with a probability mass at zero, providing a robust framework for addressing excess zeros and the underlying dispersion of data. The mathematical foundation of the distribution was derived with an emphasis on its statistical and reliability properties. The probability mass function was applicable to datasets with asymmetric dispersion and varying kurtosis structures. In addition, the hazard rate function was used to analyze failure rate behaviors, capturing patterns such as increasing, decreasing, and bathtub-shaped failure rates, often encountered in real-world datasets. Also, characterization of the proposed distribution was explored based on conditional expectation and the hazard rate function. Parameter estimation techniques were proposed, alongside computational simulations, to identify the most consistent estimators for data modeling. The goodness of fit of the proposed model was rigorously evaluated by comparing it with existing count models, demonstrating its superior ability to model zero-inflated, overdispersed data. Finally, the practical application of the new distribution was demonstrated using real-life biological engineering datasets, highlighting its effectiveness and flexibility in modeling complex zero-inflated data across various failure profiles and reliability contexts.
Citation: Wael W. Mohammed, Kalpasree Sharma, Partha Jyoti Hazarika, G. G. Hamedani, Mohamed S. Eliwa, Mahmoud El-Morshedy. Zero-inflated discrete Lindley distribution: Statistical and reliability properties, estimation techniques, and goodness-of-fit analysis[J]. AIMS Mathematics, 2025, 10(5): 11382-11410. doi: 10.3934/math.2025518
This study introduced a two-parameter zero-inflated discrete random variable distribution designed to model failure profiles in zero-inflated, dispersed datasets, commonly found in biological engineering and reliability analysis. The proposed distribution combined traditional count models, such as Poisson, Lindley, or negative binomial, with a probability mass at zero, providing a robust framework for addressing excess zeros and the underlying dispersion of data. The mathematical foundation of the distribution was derived with an emphasis on its statistical and reliability properties. The probability mass function was applicable to datasets with asymmetric dispersion and varying kurtosis structures. In addition, the hazard rate function was used to analyze failure rate behaviors, capturing patterns such as increasing, decreasing, and bathtub-shaped failure rates, often encountered in real-world datasets. Also, characterization of the proposed distribution was explored based on conditional expectation and the hazard rate function. Parameter estimation techniques were proposed, alongside computational simulations, to identify the most consistent estimators for data modeling. The goodness of fit of the proposed model was rigorously evaluated by comparing it with existing count models, demonstrating its superior ability to model zero-inflated, overdispersed data. Finally, the practical application of the new distribution was demonstrated using real-life biological engineering datasets, highlighting its effectiveness and flexibility in modeling complex zero-inflated data across various failure profiles and reliability contexts.
| [1] |
J. Neyman, On a new class of "contagious" distributions, applicable in entomology and bacteriology, Ann. Math. Statist., 10 (1939), 35–57. https://doi.org/10.1214/aoms/1177732245 doi: 10.1214/aoms/1177732245
|
| [2] | W. Feller, On a general class of "contagious" distributions, Ann. Math. Statist., 14 (1943), 389–400. |
| [3] | A. C. Cohen, Estimating the parameters of a modified poisson distribution, J. Amer. Statist. Assoc., 55 (1960), 139–143. |
| [4] | K. Yoneda, Estimations in some modified poisson distributions, Yokohama Math. J., 10 (1962), 73–96. |
| [5] | D. Lambert, Zero-inflated poisson regression, with an application to defects in manufacturing, Technometrics, 34 (1992), 1–14. |
| [6] |
P. L. Gupta, R. C. Gupta, R. C. Tripathi, Inflated modified power series distributions with applications, Comm. Statist. Theory Methods, 24 (1995), 2355–2374. https://doi.org/10.1080/03610929508831621 doi: 10.1080/03610929508831621
|
| [7] |
T. H. Lin, M. H. Tsai, Modeling health survey data with excessive zero and k responses, Stat. Med., 32 (2013), 1572–1583. https://doi.org/10.1002/sim.5650 doi: 10.1002/sim.5650
|
| [8] |
M. Arora, N. R. Chaganty, EM estimation for zero-and k-inflated poisson regression model, Computation, 9 (2021), 94. https://doi.org/10.3390/computation9090094 doi: 10.3390/computation9090094
|
| [9] |
Y. Sun, S. Zhao, G. L. Tian, M. L. Tang, T. Li, Likelihood-based methods for the zero-one-two inflated poisson model with applications to biomedicine, J. Stat. Comput. Simul., 93 (2023), 956–982. https://doi.org/10.1080/00949655.2021.1970162 doi: 10.1080/00949655.2021.1970162
|
| [10] |
M. Melkersson, D. O. Rooth, Modeling female fertility using inflated count data models, J. Popul. Econ., 13 (2000), 189–203. https://doi.org/10.1007/s001480050133 doi: 10.1007/s001480050133
|
| [11] | M. Begum, A. Mallick, N. Pal, A generalized inflated poisson distribution with application to modeling fertility data, Thail. Stat., 12 (2014), 135–159. |
| [12] |
H. Saboori, M. Doostparast, Zero to $k$ inflated poisson regression models with applications, J. Stat. Theory Appl., 22 (2023), 366–392. https://doi.org/10.1007/s44199-023-00067-3 doi: 10.1007/s44199-023-00067-3
|
| [13] |
F. Famoye, K. P. Singh, Zero-inflated generalized poisson regression model with an application to domestic violence data, J. Data Sci., 4 (2006), 117–130. https://doi.org/10.6339/JDS.2006.04(1).257 doi: 10.6339/JDS.2006.04(1).257
|
| [14] |
D. B. Hall, Zero-inflated poisson and binomial regression with random effects: A case study, Biometrics, 56 (2000), 1030–1039. https://doi.org/10.1111/j.0006-341X.2000.01030.x doi: 10.1111/j.0006-341X.2000.01030.x
|
| [15] |
T. Rahman, P. J. Hazarika, M. P. Barman, One inflated binomial distribution and its real-life applications, Indian J. Sci. Tech., 14 (2021), 1839–1854. http://dx.doi.org/10.17485/IJST/v14i22.404 doi: 10.17485/IJST/v14i22.404
|
| [16] | B. P. Singh, S. Maheshwari, P. K. Gupta, A probability model for sex composition of children in the presence of son preference, Demography India, 44 (2015), 50–57. |
| [17] | G. Nanjundan, S. Pasha, A characterization of zero-inflated binomial model, Int. J. Math. Comp. Res., 3 (2015), 1188–1190. |
| [18] |
D. C. Heilbron, Zero-altered and other regression models for count data with added zeros, Biom. J., 36 (1994), 531–547. https://doi.org/10.1002/bimj.4710360505 doi: 10.1002/bimj.4710360505
|
| [19] |
A. M. Garay, E. M. Hashimoto, E. M. M. Ortega, V. H. Lachos, On estimation and influence diagnostics for zero-inflated negative binomial regression models, Comput. Statist. Data Anal., 55 (2011), 1304–1318. https://doi.org/10.1016/j.csda.2010.09.019 doi: 10.1016/j.csda.2010.09.019
|
| [20] |
R. Suresh, G. Nanjundan, S. Nagesh, S. Pasha, On a characterization of zero-inflated negative binomial distribution, Open J. Stat., 5 (2015), 511–513. http://dx.doi.org/10.4236/ojs.2015.56053 doi: 10.4236/ojs.2015.56053
|
| [21] | R. S. A. Alshkaki, Estimation of the parameters of the zero-one inflated power series distribution, Bull. Math. Stat. Res., 4 (2016), 45–54. |
| [22] | I. J. A. Serra, D. L. L. Polestico, On the zero and k-inflated negative binomial distribution with applications, Adv. Appl. Stat., 88 (2023), 1–23. |
| [23] | N. L. Johnson, A. W. Kemp, S. Kotz, Univariate discrete distributions, John Wiley & Sons, Inc., 2005. http://dx.doi.org/10.1002/0471715816 |
| [24] | C. C. Iwunor, Estimation of parameters of the inflated geometric distribution for rural out-migration, Genus, 51 (1995), 253–260. |
| [25] |
G. D. C. Barriga, F. Louzada, The zero-inflated conway–maxwell–poisson distribution: Bayesian inference, regression modeling and influence diagnostic, Stat. Methodol., 21 (2014), 23–34. https://doi.org/10.1016/j.stamet.2013.11.003 doi: 10.1016/j.stamet.2013.11.003
|
| [26] |
A. J. Lemonte, G. Moreno-Arenas, F. Castellares, Zero-inflated bell regression models for count data, J. Appl. Stat., 47 (2019), 265–286. https://doi.org/10.1080/02664763.2019.1636940 doi: 10.1080/02664763.2019.1636940
|
| [27] |
L. Rivas, F. Campos, Zero inflated waring distribution, Comm. Statist. Simulation Comput., 52 (2023), 3676–3691. https://doi.org/10.1080/03610918.2021.1944638 doi: 10.1080/03610918.2021.1944638
|
| [28] | R. Ospina, S. L. Ferrari, Inflated beta distributions, Stat. papers, 51 (2010), 111–126. https://doi.org/10.1007/s00362-008-0125-4 |
| [29] |
A. B. Ferreira, J. Mazucheli, The zero-inflated, one and zero-and-one-inflated new unit-Lindley distributions, Braz. J. Biom., 40 (2022), 291–396. https://doi.org/10.28951/bjb.v40i3.571 doi: 10.28951/bjb.v40i3.571
|
| [30] | H. S. Bakouch, M. A. Jazi, S. Nadarajah, A new discrete distribution, Statistics, 48 (2014), 200–240. https://doi.org/10.1080/02331888.2012.716677 |
| [31] | D. V. Lindley, Fiducial distributions and bayes' theorem, J. R. Stat. Soc. Ser. B Stat. Methodol., 20 (1958), 102–107. |
| [32] | A. Nandi, S. Chakraborty, A. Biswas, A new over-dispersed count model based on poisson-geometric convolution, Comm. Statist. Simulation Comput., 2024, 1–28. https://doi.org/10.1080/03610918.2024.2329997 |
| [33] | L. Tomy, G. Veena, C. Chesneau, Exponentiated discrete modified Lindley distribution and its applications in the healthcare sector, In: G Families of probability distributions, CRC Press, 2023,279–287. |
| [34] | C. Chesneau, L. Tomy, G. Veena, The poisson-modified lindley distribution, Appl. Math. E-Notes, 22 (2022), 18–31. |
| [35] |
M. El-Morshedy, E. Altun, M. S. Eliwa, A new statistical approach to model the counts of novel coronavirus cases, Math. Sci., 16 (2022), 37–50. https://doi.org/10.1007/s40096-021-00390-9 doi: 10.1007/s40096-021-00390-9
|
| [36] | A. Zeileis, C. Kleiber, countreg: Count data regression, R package version 0.2-1. Aviliable from: https://rdrr.io/rforge/countreg/ |
| [37] |
R. Shanker, H. Fesshaye, S. Selvaraj, A. Yemane, On zero-truncation of poisson and poisson-lindley distributions and their applications, Biom. Biostat. Int. J., 2 (2015), 168–181. https://doi.org/10.15406/bbij.2015.02.00045 doi: 10.15406/bbij.2015.02.00045
|