Rating transition models are widely used for credit risk evaluation. It is not uncommon that a time-homogeneous Markov rating migration model will deteriorate quickly after projecting repeatedly for a few periods. This is because the time-homogeneous Markov condition is generally not satisfied. For a credit portfolio, the rating transition is usually path-dependent. In this paper, we propose a recurrent neural network (RNN) model for modeling path-dependent rating migration. An RNN is a type of artificial neural network where connections between nodes form a directed graph along a temporal sequence. There are neurons for input and output at each time period. The model is informed by the past behaviors for a loan along the path. Information learned from previous periods propagates to future periods. The experiments show that this RNN model is robust.
Citation: Bill Huajian Yang. Modeling path-dependent state transitions by a recurrent neural network[J]. Big Data and Information Analytics, 2022, 7: 1-12. doi: 10.3934/bdia.2022001
Rating transition models are widely used for credit risk evaluation. It is not uncommon that a time-homogeneous Markov rating migration model will deteriorate quickly after projecting repeatedly for a few periods. This is because the time-homogeneous Markov condition is generally not satisfied. For a credit portfolio, the rating transition is usually path-dependent. In this paper, we propose a recurrent neural network (RNN) model for modeling path-dependent rating migration. An RNN is a type of artificial neural network where connections between nodes form a directed graph along a temporal sequence. There are neurons for input and output at each time period. The model is informed by the past behaviors for a loan along the path. Information learned from previous periods propagates to future periods. The experiments show that this RNN model is robust.
| [1] |
Yang BH, Du Z, (2016) Rating transition probability models and CCAR stress testing. J Risk Model Validation 10: 1–19. https://doi.org/10.21314/JRMV.2016.155 doi: 10.21314/JRMV.2016.155
|
| [2] |
Yang BH, (2017) Forward ordinal models for point-in-time probability of default term structure. J Risk Model Validation 11: 1–18. https://doi.org/10.21314/JRMV.2017.181 doi: 10.21314/JRMV.2017.181
|
| [3] |
Dos Reis G, Pfeuffer M, Smith G, (2020) Capturing model risk and rating momentum in the estimation of probabilities of default and credit rating migrations. Quant Finance 20: 1069–1083. https://doi.org/10.1080/14697688.2020.1726439 doi: 10.1080/14697688.2020.1726439
|
| [4] |
Miu P, Ozdemir B, (2009) Stress testing probability of default and rating migration rate with respect to Basel Ⅱ requirements. J Risk Model Validation 3: 3–38. https://doi.org/10.21314/JRMV.2009.048 doi: 10.21314/JRMV.2009.048
|
| [5] | Kiefer NM, Larson CE, (2004) Testing Simple Markov Structures for Credit Rating Transitions. Comptroller of the Currency. |
| [6] |
Kiefer NM, Larson CE, (2007) A simulation estimator for testing the time homogeneity of credit rating transitions. J Empirical Finance 14: 818–835. https://doi.org/10.1016/j.jempfin.2006.08.001 doi: 10.1016/j.jempfin.2006.08.001
|
| [7] | Juhasz P, Vidovics-Dancs A, Szaz J, (2017) Measuring path dependency. UTMS J Econ 8: 29–37. |
| [8] |
Russo E, (2020) A discrete-time approach to evaluate path-dependent derivatives in a regime-switching risk model. Risks 8: 9. https://doi.org/10.3390/risks8010009 doi: 10.3390/risks8010009
|
| [9] |
Yang BH, (2017) Point-in-time PD term structure models for multi-period scenario loss projection. J Risk Model Validation 11: 73–94. https://doi.org/10.21314/JRMV.2017.164 doi: 10.21314/JRMV.2017.164
|
| [10] | Zhu S, Lomibao D, (2005) A conditional valuation approach for path-dependent instruments. SSRN Electron J. https://doi.org/10.2139/ssrn.806704 |
| [11] |
Graves A, Liwicki M, Fernández S, Bertolami R, Bunke H, Schmidhuber J, (2009) A novel connectionist system for unconstrained handwriting recognition. IEEE Trans Pattern Anal Mach Intell 31: 855–868. https://doi.org/10.1109/TPAMI.2008.137 doi: 10.1109/TPAMI.2008.137
|
| [12] |
Tealab A, (2018) Time series forecasting using artificial neural networks methodologies: A systemati review. Future Comput Inf J 3: 334–340. https://doi.org/10.1016/j.fcij.2018.10.003 doi: 10.1016/j.fcij.2018.10.003
|
| [13] | Hyötyniemi H, (1997) Proceedings of STeP'96, (eds. Jarmo Alander, Timo Honkela and Matti Jakobsson), Publications of the Finnish Artificial Intelligence Society, 13–24. |
| [14] |
Elman JL, (1990) Finding structure in time. Cognitive Sci 14: 179–211. https://doi.org/10.1016/0364-0213(90)90002-E doi: 10.1016/0364-0213(90)90002-E
|
| [15] |
Schmidhuber J, (2015) Deep learning in neural networks: An overview. Neural Networks 61: 85–117. https://doi.org/10.1016/j.neunet.2014.09.003 doi: 10.1016/j.neunet.2014.09.003
|
| [16] |
Abiodun OI, Jantan A, Omolara AE, Dada KV, Mohamed NA, Arshad H (2018), State-of-the-art in artificial neural network applications: A survey, Heliyon 4: e00938. https://doi.org/10.1016/j.heliyon.2018.e00938 doi: 10.1016/j.heliyon.2018.e00938
|
| [17] | Dupond S (2019), A thorough review on the current advance of neural network structures. Annu Rev Control 14: 200–230. |
| [18] | Bottou L, (2010) Large-scale machine learning with stochastic gradient descent, In Proceedings of COMPSTAT'2010 Physica-Verlag HD, 177–186. https://doi.org/10.1007/978-3-7908-2604-3_16 |
| [19] | Ge R, Huang F, Jin C, Yuan Y, (2015) Escaping from saddle points-online stochastic gradient for tensor decomposition, In Conference on Learning Theory, PMLR, 1–46. |