In survival analysis, the predictive accuracy of lifetime distributions is frequently compromised since the continuous observation assumption is violated, resulting in discretized covariate data. Conventional survival analysis estimators typically address this limitation by treating the observed measurement as the true state or by assuming a mutually independent error structure, complicating the functional predictor that systematically induces discrepancy and obscures particle risk heterogeneity. In this article we propose an alternative Bayesian hierarchical framework, the process-augmented lifetime distribution, which explicitly models the representation gap between the true latent exposure and the observed measurement as a structured, recoverable stochastic process. We construct a generative chain that decomposes the hazard function into an explicit channel spanned by the measurement basis and an orthogonal implicit channel that captures the unobserved variance. This spectral decomposition ensures the identifiability of the latent signal without requiring external validation data by enforcing rigorous orthogonal constraints. We further develop a process augmentation scheme that introduces an intermediate auxiliary variable, transforming the intractable non-Gaussian likelihood into a conjugate structure applicable to an exact Gibbs sampler. This computational formulation permits the derivation of novel posterior functional measurements, which partition the predictive variance into irreducible failure noise and reducible measurement uncertainty. We propose a simulation study and an empirical analysis of high-frequency market stability to confirm that the proposed estimator successfully corrects discrepancy error and resolves intra-bin heterogeneity, providing a robust approach for quantifying structural decision risk in systems subject to severe observation imperfections.
Citation: Xu Liu, Xufeng Niu. Process-augmented lifetime distributions: A generative Bayesian framework for latent structure inference in survival analysis[J]. AIMS Mathematics, 2026, 11(3): 6804-6833. doi: 10.3934/math.2026281
In survival analysis, the predictive accuracy of lifetime distributions is frequently compromised since the continuous observation assumption is violated, resulting in discretized covariate data. Conventional survival analysis estimators typically address this limitation by treating the observed measurement as the true state or by assuming a mutually independent error structure, complicating the functional predictor that systematically induces discrepancy and obscures particle risk heterogeneity. In this article we propose an alternative Bayesian hierarchical framework, the process-augmented lifetime distribution, which explicitly models the representation gap between the true latent exposure and the observed measurement as a structured, recoverable stochastic process. We construct a generative chain that decomposes the hazard function into an explicit channel spanned by the measurement basis and an orthogonal implicit channel that captures the unobserved variance. This spectral decomposition ensures the identifiability of the latent signal without requiring external validation data by enforcing rigorous orthogonal constraints. We further develop a process augmentation scheme that introduces an intermediate auxiliary variable, transforming the intractable non-Gaussian likelihood into a conjugate structure applicable to an exact Gibbs sampler. This computational formulation permits the derivation of novel posterior functional measurements, which partition the predictive variance into irreducible failure noise and reducible measurement uncertainty. We propose a simulation study and an empirical analysis of high-frequency market stability to confirm that the proposed estimator successfully corrects discrepancy error and resolves intra-bin heterogeneity, providing a robust approach for quantifying structural decision risk in systems subject to severe observation imperfections.
| [1] |
Y. Aït-Sahalia, J. Yu, High frequency market microstructure noise estimates and liquidity measures, Ann. Appl. Stat., 3 (2009), 422–457. http://dx.doi.org/10.1214/08-AOAS200 doi: 10.1214/08-AOAS200
|
| [2] |
H. Akaike, A new look at the statistical model identification, IEEE T. Automat. Contr., 19 (1974), 716–723. http://dx.doi.org/10.1109/TAC.1974.1100705 doi: 10.1109/TAC.1974.1100705
|
| [3] |
B. Alves, J. Dias, Survival mixture models in behavioral scoring, Expert Syst. Appl., 42 (2015), 3902–3910. http://dx.doi.org/10.1016/j.eswa.2014.12.036 doi: 10.1016/j.eswa.2014.12.036
|
| [4] |
J. Banasik, J. Crook, L. Thomas, Not if but when will borrowers default, J. Oper. Res. Soc., 50 (1999), 1185–1190. http://dx.doi.org/10.1057/palgrave.jors.2600851 doi: 10.1057/palgrave.jors.2600851
|
| [5] |
T. Bellotti, J. Crook, Forecasting and stress testing credit card default using dynamic models, Int. J. Forecasting, 29 (2013), 563–574. http://dx.doi.org/10.1016/j.ijforecast.2013.04.003 doi: 10.1016/j.ijforecast.2013.04.003
|
| [6] |
N. Benítez-Parejo, M. Rodríguez del Águila, S. Pérez-Vicente, Survival analysis and Cox regression, Allergol. Immunopath., 39 (2011), 362–373. http://dx.doi.org/10.1016/j.aller.2011.07.007 doi: 10.1016/j.aller.2011.07.007
|
| [7] | J. Bradley, C. Wikle, S. Holan, Hierarchical models for spatial data with errors that are correlated with the latent process, Stat. Sinica, 30 (2020), 81–109. |
| [8] |
J. Bradley, S. Zhou, X. Liu, Deep hierarchical generalized transformation models for spatio-temporal data with discrepancy errors, Spat. Stat., 55 (2023), 100749. http://dx.doi.org/10.1016/j.spasta.2023.100749 doi: 10.1016/j.spasta.2023.100749
|
| [9] |
J. Breeden, A. Bellotti, Y. Leonova, Instabilities in Cox proportional hazards models in credit risk, J. Credit Risk, 19 (2023), 29–55. http://dx.doi.org/10.21314/JCR.2022.014 doi: 10.21314/JCR.2022.014
|
| [10] |
S. Chaker, The signal and the noise volatilities, Res. Int. Bus. Financ., 50 (2019), 79–105. http://dx.doi.org/10.1016/j.ribaf.2019.04.008 doi: 10.1016/j.ribaf.2019.04.008
|
| [11] | C. L. Cheng, J. W. Van Ness, Statistical regression with measurement error, Kendall's library of statistics, vol. 6, Arnold, London; co-published by Oxford University Press, New York, 1999. MR1719513. |
| [12] |
D. Cox, Regression models and life-tables, J. Roy. Stat. Soc. B, 34 (1972), 187–202. http://dx.doi.org/10.1111/j.2517-6161.1972.tb00899.x doi: 10.1111/j.2517-6161.1972.tb00899.x
|
| [13] |
V. Djeundje, J. Crook, Identifying hidden patterns in credit risk survival data using generalised additive models, Eur. J. Oper. Res., 277 (2019), 366–376. http://dx.doi.org/10.1016/j.ejor.2019.02.006 doi: 10.1016/j.ejor.2019.02.006
|
| [14] |
L. Dirick, T. Bellotti, G. Claeskens, B. Baesens, Macro-economic factors in credit risk calculations: Including time-varying covariates in mixture cure models, J. Bus. Econ. Stat., 37 (2019), 40–53. http://dx.doi.org/10.1080/07350015.2016.1260471 doi: 10.1080/07350015.2016.1260471
|
| [15] |
R. F. Engle, J. R. Russell, Autoregressive conditional duration: A new model for irregularly spaced transaction data, Econometrica, 66 (1998), 1127–1162. http://dx.doi.org/10.2307/2999632 doi: 10.2307/2999632
|
| [16] |
J. J. Hanfelt, K. Y. Liang, Approximate likelihoods for generalized linear errors-in-variables models, J. Roy. Stat. Soc. B, 59 (1997), 627–637. https://doi.org/10.1111/1467-9868.00087 doi: 10.1111/1467-9868.00087
|
| [17] |
P. R. Hansen, A. Lunde, Realized variance and market microstructure noise, J. Bus. Econ. Stat., 24 (2006), 127–161. http://dx.doi.org/10.1198/073500106000000071 doi: 10.1198/073500106000000071
|
| [18] | X. He, M. L. T. Lee, First-hitting-time based threshold regression, Int. Encycl. Stat. Sci., Springer, Berlin, Heidelberg, 2011,523–524. http://dx.doi.org/10.1007/978-3-642-04898-2_252 |
| [19] | T. A. Hsieh, H. Choi, M. Kim, Multimodal representation loss between timed text and audio for regularized speech separation, arXiv preprint, 2024. |
| [20] |
M. Ibrahim, H. Goual, M. K. Khaoula, A. H. Al-Nefaie, A. M. AboAlkhair, H. M. Yousof, A novel accelerated failure time model with risk analysis under actuarial data, censored and uncensored application, Stat. Optim. Inform. Comput., 14 (2025), 1198–1225. http://dx.doi.org/10.19139/soic-2310-5070-2627 doi: 10.19139/soic-2310-5070-2627
|
| [21] |
C. Jiang, Z. Wang, H. Zhao, A prediction-driven mixture cure model and its application in credit scoring, Eur. J. Oper. Res., 277 (2019), 20–31. http://dx.doi.org/10.1016/j.ejor.2019.01.072 doi: 10.1016/j.ejor.2019.01.072
|
| [22] |
Y. Ju, S. Jeon, S. Sohn, Behavioral technology credit scoring model with time-dependent covariates for stress test, Eur. J. Oper. Res., 242 (2015), 910–919. http://dx.doi.org/10.1016/j.ejor.2014.10.054 doi: 10.1016/j.ejor.2014.10.054
|
| [23] | J. D. Kalbfleisch, R. L. Prentice, The statistical analysis of failure time data, Wiley Ser. Prob. Stat., John Wiley & Sons, 2Eds., 2002. http://dx.doi.org/10.1002/9781118032985.fmatter |
| [24] |
E. Kaplan, P. Meier, Nonparametric estimation from incomplete observations, J. Am. Stat. Assoc., 53 (1958), 457–481. http://dx.doi.org/10.2307/2281868 doi: 10.2307/2281868
|
| [25] | S. Konishi, G. Kitagawa, Information criteria and statistical modeling, Springer Series in Statistics, Springer, New York, 2008. http://dx.doi.org/10.1007/978-0-387-71887-3 |
| [26] |
I. Kuitunen, V. T. Ponkilainen, M. M. Uimonen, A. Eskelinen, A. Reito, Testing the proportional hazards assumption in cox regression and dealing with possible non-proportionality in total joint arthroplasty research: Methodological perspectives and review, BMC Musculoskel. Dis., 22 (2021), 489. http://dx.doi.org/10.1186/s12891-021-04379-2 doi: 10.1186/s12891-021-04379-2
|
| [27] |
M. L. T. Lee, G. A. Whitmore, Threshold regression for survival analysis: Modeling event times by a stochastic process, Stat. Sci., 21 (2006), 501–513. http://dx.doi.org/10.1214/088342306000000330 doi: 10.1214/088342306000000330
|
| [28] |
Z. Li, J. Crook, G. Andreeva, Y. Tang, Predicting the risk of financial distress using corporate governance measures, Pac.-Basin Financ. J., 68 (2021), 101334. http://dx.doi.org/10.1016/j.pacfin.2020.101334 doi: 10.1016/j.pacfin.2020.101334
|
| [29] |
S. Luo, X. Kong, T. Nie, Spline based survival model for credit risk modeling, Eur. J. Oper. Res., 253 (2016), 869–879. http://dx.doi.org/10.1016/j.ejor.2016.02.050 doi: 10.1016/j.ejor.2016.02.050
|
| [30] | D. Machin, Y. Cheung, M. Parmar, Survival analysis: A practical approach, John Wiley & Sons, Ltd, 2006. http://dx.doi.org/10.1002/0470034572 |
| [31] | G. Marinos, M. Karvounis, I. N. Athanasiadis, An innovative framework for threshold exceedance forecasting in timeseries using survival analysis, Knowledge Discovery, Knowledge Engineering and Knowledge Management (Communications in Computer and Information Science), 2454 (2025), 296–316. http://dx.doi.org/10.1007/978-3-031-87569-4_14 |
| [32] |
T. Moon, S. Sohn, Survival analysis for technology credit scoring adjusting total perception, J. Oper. Res. Soc., 62 (2011), 1159–1168. http://dx.doi.org/10.1057/jors.2010.80 doi: 10.1057/jors.2010.80
|
| [33] | B. Narain, Survival analysis and the credit granting decision, In: L. Thomas, D. Edelman, J. Crook (eds), Readings in Credit Scoring: Foundations, Developments, and Aims, Chapter 16, Oxford University Press, 1992,236–247. http://dx.doi.org/10.1093/oso/9780198527978.003.0022 |
| [34] |
L. H. Nghiem, M. C. Byrd, C. J. Potgieter, Estimation in linear errors-in-variables models with unknown error distribution, Biometrika, 107 (2020), 841–856. https://doi.org/10.1093/biomet/asaa025 doi: 10.1093/biomet/asaa025
|
| [35] |
H. Noh, T. Roh, I. Han, Prognostic personal credit risk model considering censored information, Expert Syst. Appl., 28 (2005), 753–762. http://dx.doi.org/10.1016/j.eswa.2004.12.032 doi: 10.1016/j.eswa.2004.12.032
|
| [36] |
M. Othus, B. Barlogie, M. L. LeBlanc, J. J. Crowley, Cure models as a useful statistical tool for analyzing survival, Clin. Cancer Res., 18 (2012), 3731–3736. 10.1158/1078-0432.CCR-11-2859 doi: 10.1158/1078-0432.CCR-11-2859
|
| [37] |
D. J. Spiegelhalter, N. G. Best, B. P. Carlin, A. van der Linde, Bayesian measures of model complexity and fit, J. Roy. Stat. Soc. B, 64 (2002), 583–639. http://dx.doi.org/10.1111/1467-9868.00353 doi: 10.1111/1467-9868.00353
|
| [38] |
E. Tong, C. Mues, L. Thomas, Mixture cure models in credit scoring: If and when borrowers default, Eur. J. Oper. Res., 218 (2012), 132–139. http://dx.doi.org/10.1016/j.ejor.2011.10.007 doi: 10.1016/j.ejor.2011.10.007
|
| [39] |
P. Wang, Y. Li, C. Reddy, Machine learning for survival analysis: A survey, ACM Comput. Surv., 51 (2019), 110. http://dx.doi.org/10.1145/3214306 doi: 10.1145/3214306
|
| [40] |
Z. Wang, C. Jiang, Y. Ding, X. Lyu, Y. Liu, A novel behavioral scoring model for estimating probability of default over time in peer-to-peer lending, Electronic Commer. Res. Appl., 27 (2018), 74–82. http://dx.doi.org/10.1016/j.elerap.2017.12.006 doi: 10.1016/j.elerap.2017.12.006
|
| [41] |
C. Zheng, J. Zhu, X. Fan, S. Chen, Z. Zhang, Promoting variable effect consistency in mixture cure model for credit scoring, Discrete Dyn. Nat. Soc., 2022 (2022), 3112987. http://dx.doi.org/10.1155/2022/3112987 doi: 10.1155/2022/3112987
|