Monotone-constrained functional component analysis for forward risk-accumulation curves with Leak-Free temporal validation and regime classification

Çağlar SÖZEN; Çağlar SÖZEN

doi:10.3934/DSFE.2026012

Data Science in Finance and Economics

2026, Volume 6, Issue 2: 336-358. doi: 10.3934/DSFE.2026012

Previous Article Next Article

Research article

Monotone-constrained functional component analysis for forward risk-accumulation curves with Leak-Free temporal validation and regime classification

Çağlar SÖZEN ^, ,

Department of Finance and Banking, Giresun University, Giresun, Turkey

Received: 17 November 2025 Revised: 02 April 2026 Accepted: 16 April 2026 Published: 01 June 2026
JEL Codes: C38, C58, G17

I studied whether a monotone-constrained functional representation could improve regime classification for forward risk-accumulation curves under a time-respecting empirical design. The empirical application focused on Apple Inc. (AAPL) and used a supportive tech-growth panel consisting of AAPL, MSFT, NVDA, AMZN, QQQ, XLK, SPY, XLC, XLY, and XLI for descriptive context. I constructed forward curves over a 60-day horizon and compared cumulative forward variance with root-cumulative forward volatility, taking the latter as the main empirical specification. To represent these structurally monotone curves, I implemented a monotone-constrained component extraction procedure, denoted mFPCA for convenience, based on isotonic projection and sequential deflation, and evaluated it against an unconstrained FPCA benchmark under identical classifiers and validation schemes. All transformations, thresholds, and tuning steps were estimated on the training slice only and then transferred unchanged to the evaluation slice. Under blocked validation, the preferred AAPL specification attained Accuracy $ 0.862 $ and Macro-F1 $ 0.863 $; under rolling walk-forward evaluation, the corresponding values were $ 0.815 $ and $ 0.822 $. Relative to FPCA, the gains from mFPCA were positive but modest. Moreover, ablation results showed that cone depth did not improve the final AAPL specification, while sensitivity analysis supported $ K = 3 $ as the main component choice. Permutation importance further indicated that predictive information was concentrated primarily in the early-to-middle portion of the forward curve. Simulation evidence showed that mFPCA was more closely aligned with latent monotone structure than FPCA when judged by matched component angles, and that random $ K $-fold validation remained mildly optimistic relative to blocked validation under temporally dependent regime-shift designs. Overall, the results supported monotone-constrained representation as a credible and structurally coherent approach to forward risk classification when combined with leak-free temporal evaluation.
- functional data analysis,
- monotone-constrained representation,
- forward risk-accumulation curves,
- temporal cross-validation,
- leak-free evaluation,
- regime classification
Citation: Çağlar SÖZEN. Monotone-constrained functional component analysis for forward risk-accumulation curves with Leak-Free temporal validation and regime classification[J]. Data Science in Finance and Economics, 2026, 6(2): 336-358. doi: 10.3934/DSFE.2026012

Related Papers:

Abstract

I studied whether a monotone-constrained functional representation could improve regime classification for forward risk-accumulation curves under a time-respecting empirical design. The empirical application focused on Apple Inc. (AAPL) and used a supportive tech-growth panel consisting of AAPL, MSFT, NVDA, AMZN, QQQ, XLK, SPY, XLC, XLY, and XLI for descriptive context. I constructed forward curves over a 60-day horizon and compared cumulative forward variance with root-cumulative forward volatility, taking the latter as the main empirical specification. To represent these structurally monotone curves, I implemented a monotone-constrained component extraction procedure, denoted mFPCA for convenience, based on isotonic projection and sequential deflation, and evaluated it against an unconstrained FPCA benchmark under identical classifiers and validation schemes. All transformations, thresholds, and tuning steps were estimated on the training slice only and then transferred unchanged to the evaluation slice. Under blocked validation, the preferred AAPL specification attained Accuracy $ 0.862 $ and Macro-F1 $ 0.863 $; under rolling walk-forward evaluation, the corresponding values were $ 0.815 $ and $ 0.822 $. Relative to FPCA, the gains from mFPCA were positive but modest. Moreover, ablation results showed that cone depth did not improve the final AAPL specification, while sensitivity analysis supported $ K = 3 $ as the main component choice. Permutation importance further indicated that predictive information was concentrated primarily in the early-to-middle portion of the forward curve. Simulation evidence showed that mFPCA was more closely aligned with latent monotone structure than FPCA when judged by matched component angles, and that random $ K $-fold validation remained mildly optimistic relative to blocked validation under temporally dependent regime-shift designs. Overall, the results supported monotone-constrained representation as a credible and structurally coherent approach to forward risk classification when combined with leak-free temporal evaluation.

References

[1]	Andersen TG, Bollerslev T, Diebold FX, et al. (2003) Modeling and forecasting realized volatility. Econometrica 71: 579–625. https://doi.org/10.1111/1468-0262.00418 doi: 10.1111/1468-0262.00418
[2]	Arlot S, Celisse A (2010) A survey of cross-validation procedures for model selection. Stat Surv 4: 40–79. https://doi.org/10.1214/09-SS054 doi: 10.1214/09-SS054
[3]	Barlow RE, Bartholomew DJ, Bremner JM, et al. (1972) Statistical inference under order restrictions. Wiley, New York.
[4]	Barndorff-Nielsen OE, Shephard N (2002) Econometric analysis of realised volatility and its use in estimating stochastic volatility models. J R Stat Soc B 64: 253–280. https://doi.org/10.1111/1467-9868.00336 doi: 10.1111/1467-9868.00336
[5]	Bergmeir C, Benítez JM (2012) On the use of cross-validation for time series predictor evaluation. Inf Sci 191: 192–213. https://doi.org/10.1016/j.ins.2011.12.028 doi: 10.1016/j.ins.2011.12.028
[6]	Cerqueira V, Torgo L, Mozetič I (2020) Evaluating time series forecasting models: an empirical study on cross-validation and windows. Int J Forecast 36: 30–44. https://doi.org/10.1007/s10994-020-05910-7 doi: 10.1007/s10994-020-05910-7
[7]	Chiou JM (2012) Dynamical functional prediction and classification, with application to traffic flow prediction. Ann Appl Stat 6: 1588–1614. https://doi.org/10.1214/12-AOAS595 doi: 10.1214/12-AOAS595
[8]	Corsi F (2009) A simple approximate long-memory model of realized volatility. J Financ Econometrics 7: 174–196. https://doi.org/10.1093/jjfinec/nbp001 doi: 10.1093/jjfinec/nbp001
[9]	Cuevas A, Febrero M, Fraiman R (2007) Robust estimation and classification for functional data via projection-based depth notions. Comput Stat 22: 481–496. https://doi.org/10.1007/s00180-007-0053-0 doi: 10.1007/s00180-007-0053-0
[10]	Davis J, Goadrich M (2006) The relationship between precision–recall and ROC curves. In: Proceedings of the 23rd International Conference on Machine Learning (ICML), 233–240. https://doi.org/10.1145/1143844.1143874
[11]	Hansen PR, Huang Z, Shek HH (2012) Realized GARCH: a joint model for returns and realized measures of volatility. J Appl Econometrics 27: 877–906. https://doi.org/10.1002/jae.1234 doi: 10.1002/jae.1234
[12]	He H, Garcia EA (2009) Learning from imbalanced data. IEEE Trans Knowl Data Eng 21: 1263–1284. https://doi.org/10.1109/TKDE.2008.239 doi: 10.1109/TKDE.2008.239
[13]	Horváth L, Kokoszka P (2012) Inference for functional data with applications. Springer, New York.
[14]	Hsing T, Eubank RL (2015) Theoretical foundations of functional data analysis, with an introduction to linear operators. Wiley, Chichester.
[15]	James GM, Hastie TJ, Sugar CA (2000) Principal component models for sparse functional data. Biometrika 87: 587–602. https://doi.org/10.1093/biomet/87.3.587 doi: 10.1093/biomet/87.3.587
[16]	López-Pintado S, Romo J (2009) On the concept of depth for functional data. J Amer Stat Assoc 104: 718–734.
[17]	Pya N, Wood SN (2015) Shape constrained additive models. Stat Comput 25: 543–559. https://doi.org/10.1007/s11222-013-9448-7 doi: 10.1007/s11222-013-9448-7
[18]	Ramsay JO (1988) Monotone regression splines in action. Stat Sci 3: 425–441. https://doi.org/10.1214/ss/1177012761 doi: 10.1214/ss/1177012761
[19]	Ramsay JO, Silverman BW (2005) Functional data analysis, 2nd ed. Springer, New York.
[20]	Rice J, Wu CO (2001) Nonparametric mixed effects models for unequally sampled noisy curves. Biometrics 57: 253–259. https://doi.org/10.1111/j.0006-341X.2001.00253.x doi: 10.1111/j.0006-341X.2001.00253.x
[21]	Roberts DR, Bahn V, Ciuti S, et al. (2017) Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure. Ecography 40: 913–929. https://doi.org/10.1111/ecog.02881 doi: 10.1111/ecog.02881
[22]	Saito T, Rehmsmeier M (2015) The precision–recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLOS ONE 10: e0118432. https://doi.org/10.1371/journal.pone.0118432 doi: 10.1371/journal.pone.0118432
[23]	Sun Y, Genton MG (2011) Functional boxplots. J Comput Graph Stat 20: 316–334. https://doi.org/10.1198/jcgs.2011.09224 doi: 10.1198/jcgs.2011.09224
[24]	Tashman LJ (2000) Out-of-sample tests of forecasting accuracy: an analysis and review. Int J Forecast 16: 437–450. https://doi.org/10.1016/S0169-2070(00)00065-0 doi: 10.1016/S0169-2070(00)00065-0
[25]	Yahoo Finance (2025) Historical data for equities and ETFs. Available from: https://finance.yahoo.com/.
[26]	Robertson T, Wright FT, Dykstra RL (1988) Order restricted statistical inference. Wiley, New York.
[27]	Zuo Y, Serfling R (2000) General notions of statistical depth function. Ann Stat 28: 461–482. https://doi.org/10.1214/aos/1016218226 doi: 10.1214/aos/1016218226
[28]	Friedman J, Hastie T, Tibshirani R (2010) Regularization paths for generalized linear models via coordinate descent. J Stat Softw 33: 1–22. https://doi.org/10.18637/jss.v033.i01 doi: 10.18637/jss.v033.i01

Reader Comments

Your name:*

Email:*
© 2026 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)