Research article Special Issues

A new extended distribution with monotonic and nonmonotonic failure rates: statistical properties and comparative predictive modeling in medical datasets

  • Published: 27 May 2026
  • MSC : 62F10, 60E05, 62H12

  • This study introduces a new continuous probability distribution, termed the extended bounded sine hyperbolic (EBSH) distribution, for modeling non-negative data with flexible structural properties. The distribution accommodates symmetric and skewed behaviors and captures a wide range of hazard rate patterns, including increasing, decreasing, and bathtub-shaped forms. Beyond its theoretical contribution, the study investigates the use of the EBSH distribution as a feature engineering mechanism in machine learning. Raw input variables are transformed through the EBSH formulation to enhance data representation and improve predictive performance. The approach is evaluated using COVID-19 mortality and breast cancer datasets, using models such as recurrent neural networks (RNN) and support vector regression (SVR). Experimental results indicate that EBSH-based feature engineering improves prediction accuracy compared to raw features. For the COVID-19 dataset, the RNN model achieved a mean absolute error (MAE) of 0.0296 and a root mean square error (RMSE) of 0.0373 using raw features, which further reduced to approximately 0.0251 (MAE) and 0.0328 (RMSE) after applying EBSH-based transformations. Similarly, for the breast cancer dataset, SVR produced an MAE of 4.0177 and an RMSE of 4.9926 with raw features, improving to about 3.6842 and 4.5213, respectively, under the engineered feature space. These findings demonstrate that the proposed EBSH distribution not only provides a flexible statistical modeling approach but also serves as an effective feature engineering tool that enhances machine learning performance across real-world datasets.

    Citation: Abdulrahman M. A. Aldawsari, Zahrah Fayez Althobaiti, Aminu Suleiman Mohammed, Abdulmajeed A. R. Alharbi. A new extended distribution with monotonic and nonmonotonic failure rates: statistical properties and comparative predictive modeling in medical datasets[J]. AIMS Mathematics, 2026, 11(5): 14870-14914. doi: 10.3934/math.2026613

    Related Papers:

  • This study introduces a new continuous probability distribution, termed the extended bounded sine hyperbolic (EBSH) distribution, for modeling non-negative data with flexible structural properties. The distribution accommodates symmetric and skewed behaviors and captures a wide range of hazard rate patterns, including increasing, decreasing, and bathtub-shaped forms. Beyond its theoretical contribution, the study investigates the use of the EBSH distribution as a feature engineering mechanism in machine learning. Raw input variables are transformed through the EBSH formulation to enhance data representation and improve predictive performance. The approach is evaluated using COVID-19 mortality and breast cancer datasets, using models such as recurrent neural networks (RNN) and support vector regression (SVR). Experimental results indicate that EBSH-based feature engineering improves prediction accuracy compared to raw features. For the COVID-19 dataset, the RNN model achieved a mean absolute error (MAE) of 0.0296 and a root mean square error (RMSE) of 0.0373 using raw features, which further reduced to approximately 0.0251 (MAE) and 0.0328 (RMSE) after applying EBSH-based transformations. Similarly, for the breast cancer dataset, SVR produced an MAE of 4.0177 and an RMSE of 4.9926 with raw features, improving to about 3.6842 and 4.5213, respectively, under the engineered feature space. These findings demonstrate that the proposed EBSH distribution not only provides a flexible statistical modeling approach but also serves as an effective feature engineering tool that enhances machine learning performance across real-world datasets.



    加载中


    [1] H. J. Wang, J. R. Zhang, B. Li, F. Z. Xuan, Machine learning-based fatigue life prediction of laser powder bed fusion additively manufactured Hastelloy X via nondestructively detected defects, Int. J. Struct. Integr., 16 (2025), 104–126. https://doi.org/10.1108/IJSI-09-2024-0161 doi: 10.1108/IJSI-09-2024-0161
    [2] L. W. Dang, X. F. He, D. C. Tang, H. Xin, B. Wu, A fatigue life prediction framework of laser-directed energy deposition Ti-6Al-4V based on physics-informed neural network, Int. J. Struct. Integr., 16 (2025), 327–354. https://doi.org/10.1108/IJSI-10-2024-0170 doi: 10.1108/IJSI-10-2024-0170
    [3] S. Y. Yang, D. B. Meng, H. T. Wang, C. Yang, A novel learning function for adaptive surrogate-model-based reliability evaluation, Philos. Trans. A Math. Phys. Eng. Sci., 382 (2024), 20220395. https://doi.org/10.1098/rsta.2022.0395 doi: 10.1098/rsta.2022.0395
    [4] H. Daud, A. A. Suleiman, A. I. Ishaq, N. Alsadat, M. Elgarhy, A. Usman, et al., A new extension of the Gumbel distribution with biomedical data analysis, J. Radiat. Res. Appl. Sci., 17 (2024), 101055. https://doi.org/10.1016/j.jrras.2024.101055 doi: 10.1016/j.jrras.2024.101055
    [5] N. G. Nia, E. Kaplanoglu, A. Nasab, Evaluation of artificial intelligence techniques in disease diagnosis and prediction, Discover Artif. Intell., 3 (2023), 5. https://doi.org/10.1007/s44163-023-00049-5 doi: 10.1007/s44163-023-00049-5
    [6] B. Saravi, F. Hassel, S. Ülkümen, A. Zink, V. Shavlokhova, S. Couillard-Despres, et al., Artificial intelligence-driven prediction modeling and decision making in spine surgery using hybrid machine learning models, J. Pers. Med., 12 (2022), 509. https://doi.org/10.3390/jpm12040509 doi: 10.3390/jpm12040509
    [7] M. A. Ghorbani, R. Khatibi, V. Karimi, Z. M. Yaseen, M. Zounemat-Kermani, Learning from multiple models using artificial intelligence to improve model prediction accuracy: application to river flows, Water Resour. Manag., 32 (2018), 4201–4215. https://doi.org/10.1007/s11269-018-2038-x doi: 10.1007/s11269-018-2038-x
    [8] S. G. Nia, Appropriate combination of artificial intelligence and algorithms for increasing predictive accuracy management, J. Inf. Technol. Manag., 2 (2010), 157–174.
    [9] N. Alotaibi, A. S. Al-Moisheer, A. S. Hassan, I. Elbatal, S. A. Alyami, E. M. Almetwally, Epidemiological modeling of COVID-19 data with advanced statistical inference based on Type-Ⅱ progressive censoring, Heliyon, 10 (2024), e36774. https://doi.org/10.1016/j.heliyon.2024.e36774 doi: 10.1016/j.heliyon.2024.e36774
    [10] T. Hirayama, Epidemiology of breast cancer with special reference to the role of diet, Preventive Med., 7 (1978), 173–195. https://doi.org/10.1016/0091-7435(78)90244-X doi: 10.1016/0091-7435(78)90244-X
    [11] A. A. Suleiman, H. Daud, A. I. Ishaq, A. U. Farouk, A. S. Mohammed, M. Kayid, et al., A new statistical model for advanced modeling of cancer disease data, Kuwait J. Sci., 52 (2025), 100429. https://doi.org/10.1016/j.kjs.2025.100429 doi: 10.1016/j.kjs.2025.100429
    [12] S. Ferrari, F. Cribari-Neto, Beta regression for modelling rates and proportions, J. Appl. Stat., 31 (2004), 799–815. https://doi.org/10.1080/0266476042000214501 doi: 10.1080/0266476042000214501
    [13] R. Kieschnick, B. D. McCullough, Regression analysis of variates observed on (0, 1): percentages, proportions and fractions, Stat. Model., 3 (2003), 193–213. https://doi.org/10.1191/1471082X03st053oa doi: 10.1191/1471082X03st053oa
    [14] M. C. Korkmaz, A new heavy-tailed distribution defined on the bounded interval: the logit slash distribution and its application, J. Appl. Stat., 47 (2020), 2097–2119. https://doi.org/10.1080/02664763.2019.1704701 doi: 10.1080/02664763.2019.1704701
    [15] J. R. van Dorp, S. Kotz, The standard two-sided power distribution and its properties: with applications in financial engineering, Amer. Statist., 56 (2002), 90–99. https://doi.org/10.1198/000313002317572745 doi: 10.1198/000313002317572745
    [16] N. L. Johnson, Systems of frequency curves generated by methods of translation, Biometrika, 36 (1949), 149–176. https://doi.org/10.2307/2332539 doi: 10.2307/2332539
    [17] C. W. Topp, F. C. Leone, A family of J-shaped frequency functions, J. Amer. Statist. Assoc., 50 (1955), 209–219. https://doi.org/10.1080/01621459.1955.10501259 doi: 10.1080/01621459.1955.10501259
    [18] A. Pourdarvish, S. M. T. K. Mirmostafaee, K. Naderi, The exponentiated Topp-Leone distribution: properties and application, J. Appl. Environ. Biol. Sci., 5 (2015), 251–256.
    [19] A. Fayomi, A. S. Hassan, E. M. Almetwally, Inference and quantile regression for the unit-exponentiated Lomax distribution, PLoS One, 18 (2023), e0288635. https://doi.org/10.1371/journal.pone.0288635 doi: 10.1371/journal.pone.0288635
    [20] M. C. Korkmaz, C. Chesneau, On the unit Burr-XⅡ distribution with the quantile regression modeling and applications, Comput. Appl. Math., 40 (2021), 29. https://doi.org/10.1007/s40314-021-01418-5 doi: 10.1007/s40314-021-01418-5
    [21] R. A. R. Bantan, F. Jamal, C. Chesneau, M. Elgarhy, Theory and applications of the unit Gamma/Gompertz distribution, Mathematics, 9 (2021), 1850. https://doi.org/10.3390/math9161850 doi: 10.3390/math9161850
    [22] A. S. Hassan, A. Fayomi, A. Algarni, E. M. Almetwally, Bayesian and non-Bayesian inference for unit-exponentiated half-logistic distribution with data analysis, Appl. Sci., 12 (2022), 11253. https://doi.org/10.3390/app122111253 doi: 10.3390/app122111253
    [23] A. S. Mohammed, F. I. Ugwuowo, On transmuted exponential-Topp Leone distribution with monotonic and non-monotonic hazard rates and its applications, Reliab. Theory Appl., 16 (2021), 197–209.
    [24] A. S. Hassan, A. M. Khalil, H. F. Nagy, Data analysis and classical estimation methods of the bounded power Lomax distribution, Reliab. Theory Appl., 19 (2024), 770–789.
    [25] A. S. Hassan, R. S. Alharbi, Different estimation methods for the unit inverse exponentiated Weibull distribution, Commun. Stat. Appl. Methods, 30 (2023), 191–213.
    [26] A. Saeed, A. Saboor, F. Jamal, N. Alsadat, O. S. Balogun, A. Faal, et al., Bounded sine hyperbolic distribution with applications to real datasets, Kuwait J. Sci., 52 (2025), 100467. https://doi.org/10.1016/j.kjs.2025.100467 doi: 10.1016/j.kjs.2025.100467
    [27] R. C. Gupta, P. L. Gupta, R. D. Gupta, Modeling failure time data by Lehmann alternatives, Comm. Statist. Theory Methods, 27 (1998), 887–904. https://doi.org/10.1080/03610929808832134 doi: 10.1080/03610929808832134
    [28] N. Eugene, C. Lee, F. Famoye, Beta-normal distribution and its applications, Comm. Statist. Theory Methods, 31 (2002), 497–512. https://doi.org/10.1081/STA-120003130 doi: 10.1081/STA-120003130
    [29] G. M. Cordeiro, M. de Castro, A new family of generalized distributions, J. Stat. Comput. Simul., 81 (2011), 883–898. https://doi.org/10.1080/00949650903530745 doi: 10.1080/00949650903530745
    [30] I. A. Elbatal, T. S. Helal, A. M. Elsehetry, R. S. Elshaarawy, Topp-Leone Weibull generated family of distributions with applications, J. Bus. Environ. Sci., 1 (2022), 183–195.
    [31] G. R. Al-Dayian, A. El-Helbawy, F. Abd El-Maksoud, Bayesian and maximum likelihood estimation for mixture models of the new Topp-Leone-G family, J. Bus. Environ. Sci., 4 (2025), 210–253.
    [32] D. Soliman, M. A. Hegazy, G. R. Al-Dayian, A. A. El-Helbawy, Statistical properties and applications of a new truncated Zubair-generalized family of distributions, Comput. J. Math. Stat. Sci., 4 (2025), 222–257.
    [33] A. Alzaatreh, C. Lee, F. Famoye, A new method for generating families of continuous distributions, Metron, 71 (2013), 63–79. https://doi.org/10.1007/s40300-013-0007-y doi: 10.1007/s40300-013-0007-y
    [34] A. Yahaya, A. S. Mohammed, Transmuted Kumaraswamy-inverse exponential distribution: its properties and applications, Niger. J. Sci. Res., 16 (2017), 298–307.
    [35] M. C. Jones, Families of distributions arising from distributions of order statistics, Test, 13 (2004), 1–43. https://doi.org/10.1007/BF02602999 doi: 10.1007/BF02602999
    [36] M. Garg, On generalized order statistics from Kumaraswamy distribution, Tamsui Oxf. J. Math. Sci. (TOJMS), 25 (2009), 153–166.
    [37] A. I. Ishaq, A. A. Suleiman, H. Daud, N. S. S. Singh, M. Othman, R. Sokkalingam, et al., Log-Kumaraswamy distribution: its features and applications, Front. Appl. Math. Stat., 9 (2023), 1258961. https://doi.org/10.3389/fams.2023.1258961 doi: 10.3389/fams.2023.1258961
    [38] A. S. Hassan, E. M. Almetwally, G. M. Ibrahim, Kumaraswamy inverted Topp-Leone distribution with applications to COVID-19 data, Comput. Mater. Continua, 68 (2021), 337–358. https://doi.org/10.32604/cmc.2021.013971 doi: 10.32604/cmc.2021.013971
    [39] J. W. Boag, Maximum likelihood estimates of the proportion of patients cured by cancer therapy, J. R. Stat. Soc. Ser. B, 11 (1949), 15–53.
    [40] I. W. Burr, Cumulative frequency functions, Ann. Math. Stat., 13 (1942), 215–232.
    [41] A. I. Ishaq, A. A. Abiodun, A. A. Suleiman, A. Usman, A. S. Mohammed, M. Tasiu, Modelling Nigerian inflation rates from January 2003 to June 2023 using newly developed inverse power chi-square distribution, In: 2023 4th International Conference on Data Analytics for Business and Industry (ICDABI), Bahrain, 2023,644–651. https://doi.org/10.1109/ICDABI60145.2023.10629442
    [42] N. L. Johnson, S. Kotz, N. Balakrishnan, Beta distributions, In: Continuous univariate distributions, 2 Eds., John Wiley and Sons, 1994,221–235.
    [43] W. Weibull, A statistical distribution function of wide applicability, J. Appl. Mech., 18 (1951), 293.
    [44] P. Kumari, V. Goswami, H. Narasimhamurthy, R. S. Pundir, Recurrent neural network architecture for forecasting banana prices in Gujarat, India, PLoS One, 18 (2023), e0275702. https://doi.org/10.1371/journal.pone.0275702 doi: 10.1371/journal.pone.0275702
    [45] T. Wilberforce, A. Alaswad, A. Garcia-Perez, Y. C. Xu, X. H. Ma, C. Panchev, Remaining useful life prediction for proton exchange membrane fuel cells using combined convolutional neural network and recurrent neural network, Int. J. Hydrogen Energy, 48 (2023), 291–303. https://doi.org/10.1016/j.ijhydene.2022.09.207 doi: 10.1016/j.ijhydene.2022.09.207
    [46] W. Cai, X. D. Wen, C. E. Li, J. J. Shao, J. G. Xu, Predicting the energy consumption in buildings using the optimized support vector regression model, Energy, 273 (2023), 127188. https://doi.org/10.1016/j.energy.2023.127188 doi: 10.1016/j.energy.2023.127188
    [47] S. Makridakis, M. Hibon, ARMA models and the Box-Jenkins methodology, J. Forecast., 16 (1997), 147–163.
    [48] M. Ogunsanya, J. Isichei, S. Desai, Grid search hyperparameter tuning in additive manufacturing processes, Manufactur. Lett., 35 (2023), 1031–1042. https://doi.org/10.1016/j.mfglet.2023.08.056 doi: 10.1016/j.mfglet.2023.08.056
    [49] F. Kuran, G. Tanırcan, E. Pashaei, Performance evaluation of machine learning techniques in predicting cumulative absolute velocity, Soil Dyn. Earthq. Eng., 174 (2023), 108175. https://doi.org/10.1016/j.soildyn.2023.108175 doi: 10.1016/j.soildyn.2023.108175
  • Reader Comments
  • © 2026 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(120) PDF downloads(16) Cited by(0)

Article outline

Figures and Tables

Figures(10)  /  Tables(24)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog