On a dependent risk model perturbed by mixed-exponential jump-diffusion processes

Zhipeng Liu; Cailing Li; Zhenhua Bao; Zhipeng Liu; Cailing Li; Zhenhua Bao

doi:10.3934/math.2025452

AIMS Mathematics

2025, Volume 10, Issue 4: 9882-9899. doi: 10.3934/math.2025452

Previous Article Next Article

Research article

On a dependent risk model perturbed by mixed-exponential jump-diffusion processes

1.
School of Economics, Liaoning University of International Business and Economics, Dalian 116052, China
2.
School of Mathematics, Liaoning Normal University, Dalian 116081, China

Received: 02 December 2024 Revised: 22 March 2025 Accepted: 15 April 2025 Published: 27 April 2025
MSC : 91B30, 91B70

In the present paper, we investigate a dependent risk model perturbed by a mixed-exponential jump-diffusion process, in which the claim inter-arrival times and claim sizes are dependent through Farlie-Gumbel-Morgenstern (FGM) copula. The expected discounted penalty (EDP) functions are studied when ruin is caused by a claim or the jump-diffusion process. The Laplace transforms satisfied by the EDP functions are obtained, then we give the corresponding defective renewal equations. The analytical expressions for the EDP functions are derived when the claim sizes follow exponential distributions, and a numerical example for the ruin probabilities are also provided.

Keywords:

expected discounted penalty function,
dependence,
mixed-exponential jump-diffusion process,
defective renewal equation,
Laplace transform

Citation: Zhipeng Liu, Cailing Li, Zhenhua Bao. On a dependent risk model perturbed by mixed-exponential jump-diffusion processes[J]. AIMS Mathematics, 2025, 10(4): 9882-9899. doi: 10.3934/math.2025452

Related Papers:

[1]	Olha Kravchenko, Nadiia Bohomolova, Oksana Karpenko, Maryna Savchenko, Nataliia Bondar . Scenario-based financial planning: the case of Ukrainian railways. National Accounting Review, 2020, 2(3): 217-248. doi: 10.3934/NAR.2020013
[2]	Asensi Descals-Tormo, María-José Murgui-García, José-Ramón Ruiz-Tamarit . A new strategy for measuring tourism demand features. National Accounting Review, 2024, 6(4): 480-497. doi: 10.3934/NAR.2024022
[3]	Basma Ben Néfissa, Faouzi Jilani . Impact of the mandatory adoption of IFRS on the quality of financial forecasts. National Accounting Review, 2020, 2(1): 95-109. doi: 10.3934/NAR.2020006
[4]	Jinhong Wang, Yanting Xu . Factors influencing the transition of China's economic growth momentum. National Accounting Review, 2024, 6(2): 220-244. doi: 10.3934/NAR.2024010
[5]	Dimitris G. Kirikos . An evaluation of quantitative easing effectiveness based on out-of-sample forecasts. National Accounting Review, 2022, 4(4): 378-389. doi: 10.3934/NAR.2022021
[6]	Dina Krasnoselskaya, Venera Timiryanova . Exploring the impact of ecological dimension on municipal investment: empirical evidence from Russia. National Accounting Review, 2023, 5(3): 227-244. doi: 10.3934/NAR.2023014
[7]	Serhii Shvets . Public investment as a growth driver for a commodity-exporting economy: Sizing up the fiscal-monetary involvement. National Accounting Review, 2024, 6(1): 95-115. doi: 10.3934/NAR.2024005
[8]	Goshu Desalegn, Anita Tangl . Forecasting green financial innovation and its implications for financial performance in Ethiopian Financial Institutions: Evidence from ARIMA and ARDL model. National Accounting Review, 2022, 4(2): 95-111. doi: 10.3934/NAR.2022006
[9]	Mohammad Reza Abbaszadeh, Mehdi Jabbari Nooghabi, Mohammad Mahdi Rounaghi . Using Lyapunov's method for analysing of chaotic behaviour on financial time series data: a case study on Tehran stock exchange. National Accounting Review, 2020, 2(3): 297-308. doi: 10.3934/NAR.2020017
[10]	Adolfo Maza, Maria Hierro . A regional macroeconomic approach to the link between immigration and wages in Spain: New insights from a spatial econometric perspective. National Accounting Review, 2025, 7(2): 177-195. doi: 10.3934/NAR.2025008

Abstract

1. Introduction

The increased availability of a large amount of data allows researchers to model and forecast more accurately in many fields (e.g., see Choi and Varian, 2012; Varian, 2014; Varian and Scott, 2014; Einav and Levin, 2014). However, the main issues when dealing with high-dimensional models for large datasets are over-parametrization, over-fitting, and high out-of-sample forecasting errors (Granger, 1998). Various solutions have been proposed, such as regularization (Zou and Hastie, 2005), stochastic search variable selection (George et al., 2008), graphical models (Ahelgebey et al., 2016a, 2016b), and random projections (Koop et al., 2017; Casarin and Veggente, 2021). This paper considers factor models (Stock and Watson, 2002, 2004, 2005, 2012, 2014; Banbura et al., 2010; Casarin et al, 2020; Billio et al., 2022). Relevant information is summarized through a limited number of factors, describing the overall economic conditions and providing accurate forecasts of the variables of interest.

It has been proved, that factor model estimates can be heavily affected by outliers: data points that differ significantly from other observations in the sample. An outlier may be due to variability in the measurement or significant experimental errors; the latter are sometimes excluded from the data set. After the 2009 crisis and the COVID-19 pandemic event, the treatment of outliers attracted the attention of both researchers and the institutes of official statistics, which provided some guidelines on monitoring the effects of outliers when using their data (e.g., see Eurostat, 2020). In this paper we follow Artis et al. (2005), Croux et al. (2003), Bai et al. (2022), Fan et al. (2021) and apply robust estimation methods to factor models to limit the effects of the outliers. We contribute to the robust factor literature by comparing alternative robust factor models in terms of forecasting performances on a set of variables which are central to the economic analysis. Our database includes the 2009 crisis and the beginning of COVID-19 pandemic in March 2020 and consider as a last data January 2021; the pandemic is potentially the most important source of outliers, and its effects on the economic systems have been extensively investigated in some recent studies (Fabeil et al., 2020; Fernandes, 2020; McKibbin and Vines, 2020; McKibbin and Roshen, 2021; Liu, 2021). We shall notice that the amount of sample information is not large enough to estimate forecasting models with structural breaks since adopting them implies that the current model is estimated only using data observed since the most recent break. Similarly, it is not possible to test for a break and compare the two models for the period before and after the pandemic since the spread of contagion and its effects did not yet come to an end. This paper provides an alternative solution and shows that samples from the pandemic period have some information content which can still be used to estimate models without breaks provided a proper inference technique, such as robust inference for outliers, is applied.

The structure of the paper is as follows. Section 2 presents some background on robust inference for outliers. Section 3 introduces standard factors model and the two methodologies used to treat the outliers. Section 4 provides a data description and the empirical results obtained with robust inference methods for factor models. Section 5 concludes the chapter.

2. Background on robust estimation

The true nature of outliers can be very elusive and dealing with data affected by outliers poses some challenges. There is no unanimous definition for what an outlier is. Outliers could be atypical samples that have an unusually large influence on the estimated model parameters. Outliers could also be perfectly valid samples from the same distribution as the rest of the data that happen to be small-probability instances. Alternatively, outliers could be samples drawn from a different model, and therefore they will likely not be consistent with the model derived from the rest of the data. There is no way to tell which is the case for a particular "outlying" sample point, nevertheless some techniques can be applied to detect outliers. A standard procedure makes use of the linear projection of the dependent variable into the linear space of covariates, the hat matrix of the data. The diagonal of the hat matrix is used to detect outlying observations that may have an impact on the inference. Usually, outliers are excluded from the dataset when estimating the model (data-trimming). See, for example Davidson and McKinnon (2004). In this paper, we compare trimming with two alternative approaches.

The first approach is based on Mahalanobis distances and can applied for detection and robust estimation. We consider robust estimators of multivariate location and scatter computed from the explanatory variables. Many methods for estimating multivariate location and scatter break down in the presence of $T/(n+1)$ outliers, where $T$ is the number of observations and $n$ is the number of variables, as was pointed out by Donoho (1982). For the breakdown value of the multivariate F-estimators of Maronna (1976), see Hampel et al. (1986). In the meantime, several positive breakdown estimators of multivariate location and scatter have been proposed. The Minimum Covariance Determinant (MCD), a highly robust estimator of multivariate location and scatter (Rousseeuw, 1984) which uses only the observations whose covariance matrix has the lowest determinant, was proposed by Rousseeuw and Leroy (1987). Consistency and asymptotic normality of the MCD estimator has been shown by Butler et al. (1993) and , whereas has been demonstrated that MVE (Minimum Volume Ellipsoid) has a lower convergence rate (). The MCD has a bounded influence function () and it has the highest possible breakdown value (i.e., 50%) when the number of observations used is $⌊\left(T+n+1\right)/2⌋$ (Lopuha and Rousseeuw, 1991). In addition to being highly resistant to outliers, the MCD is affine equivariant, i.e., the estimates behave properly under affine transformations of the data. Although the MCD was already introduced in 1984, its practical use only became feasible since the introduction of the computationally efficient Fast MCD (FMCD) algorithm of Rousseeuw and Van Driessen (1999), and some extensions have been determined (Hubert et al., 2017); in this paper we follow FMCD technique. MCD have been successfully applied in many fields such as finance and econometrics (Gambacciani & Paolella, 2017; Orhan et al., 2001), quality control (Jensen et al., 2007), geophysics (Neykov, et al., 2007), geochemistry (Filzmoser et al., 2005), image analysis (Vogler et al., 2007). MCD has been used for robust factor model estimation by Croux et al. (2003) and Filzmoser et al. (2003).

The second approach considered, is the Iteratively Reweighted Least Squares (IRLS) proposed in (De la Torre and Black, 2004), which relies on the residuals of the linear projection of the dependent variable on a space generated by a set of factors. The outliers are detected as those that have a large residual with respect to the identified subspace. A new subspace is estimated with the outliers downweighted, and this process is then repeated until the estimated model stabilizes. With this algorithm for every multivariate sample a weight is determined iteratively, reducing the weights related to the outliers until the procedure converge. This technique has been used for outliers' reduction, (Bergstrom and Edlund, 2014), outliers afflicted observations (Kargoll et al., 2018) and in forecasting (Mbamalu et al., 1993). Other applications are statistical estimation (Green, 1984), matrix rank minimization (Mohan and Fazel, 2012), and sparse matrix (Daubechies et al., 2009).

3. Factor models

In the following, we introduce Factor Models (FM), data trimming and three approaches to outlier handling: ⅰ) standard FM (FM Std) where all data are included without any transformation; ⅱ) Fast Minimum Covariance Determinant methodology combined with FM (FM FMCD) and ⅲ) Iterated Reweighted Least Squares combined with Factors Model (FM IRLS).

In the empirical analysis, a Vector Autoregressive (VAR) model is used for predicting the factors, and according to , series without unit roots should be used when forecasting with VARs. To meet this requirement for the factors, we perform a unit root ADF test on all variables included in the factor analysis. If necessary, variables have been differentiated to obtain a stationary time series; after this step, we normalize the series and extract the factors. Thus, in the following we assume our $T\times n$ data matrix $X$ is covariance stationary with null mean and unitary standard deviation.

3.1. A standard factor model

3.1.1. Subheading

In this paper we use factor models, (see, e.g., Stock and Watson, 2002, 2004, 2005, 2012, 2014; Banbura et al., 2010; Banbura et al., 2014; Artis et al., 2005), with reduced number of factors (Bai and Ng, 2002). See Diebold (2003) and Stock and Watson (2009) for review of factor models.

Let ${X}_{t}$ , t > 0, be a random process with ${X}_{t} = ({x}_{1t}, \dots, {x}_{nt}){'}$ a $(n\times 1)$ random vector. The time index $t$ represents months or quarters, and we assume the process is covariance stationary with null mean and a standard deviation equal to one. Latent factors extraction relies on the following decomposition:

$E\left[{X}_{t}{X}_{t}^{'}\right]{a}_{i} = {{\mathit{\Gamma}} }_{X}{a}_{i} = {\lambda }_{i}{a}_{i},$

(1)

where ${a}_{i}$ and ${\lambda }_{i},$ $i = 1, \dots, n$ , are the $n$ -dimensional eigenvectors and the eigenvalues in decreasing order, respectively. Let $A$ be an $(n\times n)$ orthonormal matrix with the normalized eigenvectors in the columns, also called factor loading matrix, then:

${{\mathit{\Gamma}} }_{X}A = A{\mathit{\Lambda}},$

(2)

where $\mathrm{{{\Lambda}} }$ is a diagonal matrix with elements ${\lambda }_{i},$ $i = 1, \dots, n,$ on the main diagonal. The vector of $n$ factors ${F}_{n, t} = ({f}_{1, t}, \dots, {f}_{n, t}){'}$ is given by the linear transformation:

${F}_{n, t} = {A}^{'}{X}_{t},$

(3)

And ${f}_{k, t}$ , $t > 0$ is the $k$ -th factor. Let us denote with ${\mathrm{{{\Gamma}} }}_{n}$ the expectation of the external product of the factor, $E\left[{F}_{n, t}{F}_{n, t}^{\mathrm{'}}\right];$ then one obtains the following relationship between ${\mathrm{{{\Gamma}} }}_{n}$ and the eigenvector matrix $\mathrm{{{\Lambda}} }:$

${{\mathit{\Gamma}} }_{n} = E\left[{A}^{'}{X}_{t}{X}_{t}^{'}A\right] = {A}^{'}E\left[{X}_{t}{X}_{t}^{'}\right]A = {{A}^{'}{\mathit{\Gamma}} }_{X}A = {\mathit{\Lambda}} .$

(4)

Let ${F}_{k, t} = ({f}_{1, t}, \dots, {f}_{k, t}){'}$ be the collection of the first k factors at time t, with $k < n$ , then:

${F}_{k, t} = {A}_{k}^{'}{X}_{t},$

(5)

where ${A}_{k}$ is the matrix containing the first k columns of $A$ . Since the columns of $A$ are orthogonal, then ${A}_{k}^{\mathrm{'}}{A}_{k} = {I}_{k}.$ The first k factors capture the following proportion of the total variance:

${V}_{k} = \sum\limits_{i = 1}^{k}{\lambda }_{i}/\sum\limits_{i = 1}^{n}{\lambda }_{i}.$

(6)

The collection of factors ${F}_{k, t}$ is customarily called standard FM (FM Std).

3.2. A robust factor model: the fast minimum covariance determinant estimator

In $n$ -variate data, n > 2, it is difficult to detect outliers because one can no longer rely on visual inspection, nevertheless a set of summary statistics can be used. One of the statistics used in the literature is the Mahalanobis distance:

$D\left({x}_{t}, \widehat{\mu }, \widehat{{\mathit{\Sigma}} }\right) = {D}_{t} = \sqrt{{\left({x}_{t}-\widehat{\mu }\right)}^{'}{\widehat{{\mathit{\Sigma}} }}^{-1}\left({x}_{t}-\widehat{\mu }\right)},$

(7)

where ${x}_{t}$ is the $t$ -th row of the data matrix X, $\widehat{\mu }$ is the estimator of the location, and $\widehat{\mathrm{{{\Sigma}} }}$ is the covariance matrix estimator. Using this distance, one obtains the classical tolerance ellipse defined as the set of $n$ -dimensional points ${x}_{t}, t = 1, \dots, T$ . Detecting outliers by means of the Mahalanobis distance no longer suffices for multiple outliers because of the masking effect, by which multiple outliers do not necessarily have large Mahalanobis distances (Hubert ET AL., 2017). We consider a robust estimator of multivariate location and scatter base on the notion of Minimum Covariance Determinant (MCD) (Rousseeuw, 1984; ; ). In the MCD, only the r observations, $⌊\left(T+n+1\right)/2⌋\le r\le T$ , whose classical covariance matrix has the lowest determinant are considered in the computation of the Mahalanobis distance:

$RD\left({x}_{t}, {\stackrel{-}{\mu }}_{MCD}, {\widehat{{\mathit{\Sigma}} }}_{MCD}\right) = \sqrt{{\left({x}_{t}-{\widehat{\mu }}_{MCD}\right)}^{'}{\widehat{{\mathit{\Sigma}} }}_{MCD}^{-1}\left({x}_{t}-{\widehat{\mu }}_{MCD}\right)}$

(8)

where ${\widehat{\mu }}_{MCD}$ and ${\widehat{\mathrm{{{\Sigma}} }}}_{MCD}$ are the MCD estimator of the mean and the covariance matrix respectively defined as follow:

${\widehat{\mu }}_{MCD} = \frac{\sum _{t = 1}^{T}W\left({d}_{t}^{2}\right){x}_{t}}{\sum _{t = 1}^{T}W\left({d}_{t}^{2}\right)};{\widehat{{\mathit{\Sigma}} }}_{MCD} = {c}_{1}\frac{1}{T}\sum\limits_{t = 1}^{T}W\left({d}_{t}^{2}\right)\left({x}_{t}-{\widehat{\mu }}_{MCD}\right){\left({x}_{t}-{\widehat{\mu }}_{MCD}\right)}^{'}$

(9)

Where $W\left({d}_{t}^{2}\right)$ is an appropriate weight function and ${c}_{1}$ is a consistency factor (e.g., see Lopulhaa and Rousseuw, 1991). Note that the MCD estimator can only be computed when r > n, otherwise the covariance matrix of any r-subset has determinant 0, so we need at least T > 2n. To avoid excessive noise, it is recommended that T > 5n, so that we have at least five observations per dimension.

The MCD estimator is computationally expensive as it requires the evaluation of $\left(\genfrac{}{}{0pt}{}{T}{r}\right)$ subsets of size r and for this reason we use the Fast Minimum Covariance Determinant estimator (FMCD) of . A major component of the FMCD algorithm is the concentration step, C-step, which works as follows. Given the initial estimates ${\widehat{\mu }}_{old}$ and ${\widehat{{\mathit{\Sigma}} }}_{old}$ :

● Compute the distances ${d}_{old}\left(t\right) = D\left({x}_{t}, {\widehat{\mu }}_{old}, {\widehat{{\mathit{\Sigma}} }}_{old}\right), t = 1, \dots, T.$

● Sort these distances and yield a permutation $\tau$ such that:

${d}_{old}\left(\tau \left(1\right)\right)\le {d}_{old}\left(\tau \left(2\right)\right)\le \dots \le {d}_{old}\left(\tau \left(n\right)\right) \ {\rm{and}} \ {\rm{set}} \ H = \left\{\tau \left(1\right), \tau \left(2\right), \dots, \tau \left(r\right)\right\} .$

● Compute location and scale estimators:

${\widehat{\mu }}_{new} = \frac{1}{r}\sum\limits_{t\in H}{x}_{t}, \ \ \ {\widehat{{\mathit{\Sigma}} }}_{new} = \frac{1}{r-1}\sum\limits_{t\in H}\left({x}_{t}-{\widehat{\mu }}_{new}\right){\left({x}_{t}-{\widehat{\mu }}_{new}\right)}^{'}$

(10)

In Theorem 1 of it is proved that $det\left({\widehat{\mathrm{{{\Sigma}} }}}_{new}\right)\le det\left({\widehat{\mathrm{{{\Sigma}} }}}_{old}\right)$ , with equality only if ${\widehat{\mathrm{{{\Sigma}} }}}_{new} = {\widehat{\mathrm{{{\Sigma}} }}}_{old}$ . Thus, if we iterate the C-step, the sequence of determinants obtained in this way converges in a finite number of iterations. The FMCD algorithm supplies a sequence of weights, one or zero (zero for the outliers), that has length $T$ , and we repeat this sequence for n column to obtain the matrix ${H}_{MCD}$ with dimension $T\times n$ . We multiply the data matrix X by ${H}_{MCD}$ :

${H}_{MCD}\odot X = {X}_{MCD}$

(11)

where $\odot$ denotes the Hadamard's product. We use ${X}_{MCD}$ to extract the factors (; Pison et al., 2003) ${F}_{k, t}$ , as described in Section 3.1 and obtain the model FM FMCD. In this paper we set $r = 0.95T$ .

3.3. A robust factor model: the iterated reweighted least squares estimator

The Maximum-Likelihood-Type Estimator (M-Estimator) is another popular robust method for estimating the location and scale of a set of points, and its application leads to the Iterated Reweighted Least Squares (IRLS) (Bergstrom et al., 2014; Daubechies et al., 2009). Define the residual as follows:

${\varepsilon }_{t} = {||{x}_{t}-{A}_{k}{F}_{k, t}||}_{2}$

(12)

where ${A}_{k}$ and ${F}_{k, t}$ have been defined in Section 3.1, and ${||\cdot ||}_{2}$ is the Euclidean norm for vectors. IRLS assumes continuous weights as a function of the residual:

${w}_{t} = \frac{\rho \left({\varepsilon }_{t}\right)}{{\varepsilon }_{t}^{2}}$

(13)

For some given robust loss function $\rho \left(\cdot \right)$ from the set of the real to the positive reals. The objective function then becomes:

$\sum\limits_{t = 1}^{T}\rho \left({\varepsilon }_{t}\right)$

(14)

Many loss functions have been proposed in the statistics literature (; Barnett and Lewis 1984). When $\rho \left({\varepsilon }_{t}\right) = {\varepsilon }_{t}^{2}$ , all weights are equal to 1, and we obtain the standard least-squares solution, which is not robust. Other robust loss functions are described in Vidal et al. (2016), in this work we use a Geman-McClure loss (Geman and McClure, 1987):

$\rho \left({\varepsilon }_{t}\right) = \frac{{\varepsilon }_{t}^{2}}{{{\varepsilon }_{0}^{2}+\varepsilon }_{t}^{2}}$

(15)

where ${\varepsilon }_{0}^{2}$ is a parameter that we consider equal to the square root of mean of ${\varepsilon }_{t}^{2}$ . Following , we use a Geman-McClure loss scaled by ${\varepsilon }_{0}^{2}$ which yields the following procedure. Given an initial parameter ${\varepsilon }_{0}^{2}$ and factor loadings and factors, ${A}_{k}$ and ${F}_{k, t}$ , respectively, obtained from the FM Std of Section 3.1, iterate until convergence the following steps:

1. Compute the residuals ${\varepsilon }_{t} = {||{x}_{t}-{A}_{k}{F}_{k, t}||}_{2}$ .

2. Compute the weights ${w}_{t} = \frac{{\varepsilon }_{0}^{2}}{{{\varepsilon }_{0}^{2}+\varepsilon }_{t}^{2}}$ .

3. Estimate the covariance ${\mathit{\Sigma}} \leftarrow \frac{\sum _{t = 1}^{T}{w}_{t}{x}_{t}{x}_{t}^{'}}{\sum _{t = 1}^{T}{w}_{t}}.$

4. Extract the first $k$ largest eigenvalues of ${\mathit{\Sigma}}$ and collect the corresponding eigenvectors in ${A}_{k}$ .

The factor matrix ${F}_{k, t}$ obtained as above is called FM IRLS.

3.4. A forecasting model

Once factors are extracted a forecasting procedure is needed to predict the variables of interest. We assume the first $k$ latent factors (determined with the three methodologies described above), ${F}_{k, t} = ({f}_{1, t}, \dots, {f}_{k, t})\mathrm{'}$ , with $k < n$ , follow a VAR model. Using only $k$ factors, the reconstruction of the variables derives from the approximated model:

${X}_{k, t} = {A}_{k}{F}_{k, t}$

(16)

With the term ${X}_{k, t}$ we mean the approximation of the vector ${X}_{t}$ obtained using the first k factors ${F}_{k, t}$ . Considering the dynamic part related to the $k$ factors, our model is thus as follows:

${X}_{k, t} = {A}_{k}{F}_{k, t}$

(17)

${F}_{k, t} = {{c}_{k}+{\mathit{\Phi}} }_{k}{F}_{k, t-1}+{\varepsilon }_{k, t},\ \ {\varepsilon }_{k, t}WN\left(0, {{\mathit{\Sigma}} }_{k}\right)$

(18)

where ${\mathrm{c}}_{k}$ has dimension k x 1 and ${\mathrm{\Phi }}_{k}$ has dimensions k x k. As shown in , under VAR assumption for the factors, the variables of interest ${X}_{k, t}$ follow a VAR model with restrictions. Thus, the conditional forecasts ${X}_{k, t+h}$ at the horizon $h$ , $h = \mathrm{ }1, \dots, \mathrm{H}$ are obtained as follows:

${X}_{k, t+h|t} = {A}_{k}{\widehat{F}}_{k, t+h|t}$

(19)

where:

${\widehat{F}}_{k, t+h|t} = E\left[{F}_{k, t+h}|{X}_{1}, \dots, {X}_{t}\right] = {{c}_{k}+{\mathit{\Phi}} }_{k}{\widehat{F}}_{k, t+h-1|t}$

(20)

To summarize, we first estimate the latent factors and then use a VAR model on factors to forecast both the factors and the variables of interest. After then, the variables will be reconverted to their correct values reversing the procedures of normalization and integrated if they have been previously differentiated.

4. Empirical applications

4.1. Data description

We consider a dataset of macroeconomic variables related to the US and the EU economies, provided by Bloomberg. It consists of 42 monthly variables and 2 quarterly variables, sampled from December 2001 to January 2021, and includes some key variables for policy making: core and headline prices, labour market variables, imports, exports, industrial production, consumption, sales, leading indicators of interest rates, and the term structure. See Table 1 for a more detailed description.

Table 1. Macroeconomic variables for two major geographical regions, the US and EU, sampled either at monthly or quarterly frequency from December 2001 to January 2021.

N	C	Definition	L	MU	F
1	US	Export	Ex	m/m	M
2	US	Import	Im	m/m	M
3	US	Unemployment rate	UR	%	M
4	US	Employment (Agricultural sector)	EA	thousands	M
5	US	Employment (Private sector)	EP	thousands	M
6	US	Average hourly wages	Ahw	m/m	M
7	US	PCE	PCE	y/y	M
8	US	PCE core	PCEc	y/y	M
9	US	PPI	PPI	y/y	M
10	US	Industrial Production	IP	y/y	M
11	US	Industrial Orders	IO	m/m	M
12	US	Durable goods orders	Dgo	m/m	M
13	US	Durable goods orders excluding transport	Dgoet	m/m	M
14	US	Stocks	S	m/m	M
15	US	Use of production capacity	Upc	%	M
16	US	ISM manufacturing	ISMm	level	M
17	US	Start of new construction sites	Snc	m/m	M
18	US	Constructions expenditure	Cse	m/m	M
19	US	Existing homes sale	Ehs	m/m	M
20	US	New homes sale	Nhs	m/m	M
21	US	Expenditure (real)	Er	m/m	M
22	US	Income (real)	Ir	m/m	M
23	US	Retail sales	RS	m/m	M
24	US	Conference Board	CB	level	M
25	US	Michigan Consumer Sentiment Index	MCSI	level	M
26	US	RUS10 (Int.Rate Gov.Bond 10Y US)	RUS10	Yield	M
27	US	DeltaRUS72 (=RUS7-RUS2)	DRUS72	Yield	M
28	EU	Export	Ex	m/m	M
29	EU	Import	Im	m/m	M
30	EU	Unemployment rate	UR	%	M
31	EU	HCPI	HCPI	y/y	M
32	EU	CPI core	CPIc	y/y	M
33	EU	PPI	PPI	y/y	M
34	EU	Industrial Production	IP	y/y	M
35	EU	Constructions expenditure	Cse	m/m	M
36	EU	PMI manufacturing index	PMImI	level	M
37	EU	ESI	ESI	level	M
38	EU	Leading indicator	LeIn	level	M
39	EU	Retail Sales	RS	y/y	M
40	EU	REMU10 (Int.Rate Gov.Bond 10Y EU)	REMU10	Yield	M
41	EU	DeltaREMU72 (=REMU7-REMU2)	DREMU72	Yield	M
42	EU/US	CEUUS	CEUUS	Ratio?/$	M
43	US	Gross Domestic Product	GDPUS	q/q	Q
44	EU	Gross Domestic Product	GDPEU	q/q	Q
Note: In the columns, the series: number (N), country (C), description (Definition), label (L), measure unit (MU), and frequency (F) that is quarterly (Q) or monthly (M).

| Show Table

DownLoad: CSV

In our dataset, the 2009 financial crisis and the COVID-19 pandemic generated outliers in many time series. For example, a graphical inspection of the US unemployment rate series reveals the dramatic impact of COVID-19 pandemic after March 2021 (Figure 1). In presence of outliers, the researcher can choose to trim the data, that is to reduce or eliminate the outliers and to run inference on a linear model overcoming the inference issues (such as bias) that the outliers can generate. Data trimming requires outliers are detected first. To detect the presence of the outliers, a standard procedure consists in fitting the linear regression model:

$Y = X\beta +\varepsilon$

(21)

Figure 1. The US Unemployment rate (in percentage) from December 2001 to January 2021.

DownLoad: Full-Size Img PowerPoint

By Least Squares and recovering the hat matrix $H$ from the fitted value of Y:

$\widehat{Y} = X\widehat{\beta } = X{\left({X}^{'}X\right)}^{-1}{X}^{'}Y = HY$

(22)

The hat matrix $H$ is a symmetrical and idempotent T x T projection matrix; it has n eigenvalues equal to one and T-n equal to zero. The diagonal elements ${h}_{t, t}$ have the following property:

$0\le {h}_{t, t}\le 1, t = 1, \dots, T$

(23)

Points where ${h}_{t, t}$ have large values are called leverage points, and it can be proved that the presence of leverage points signals that there are observations that might have a decisive influence on the estimation of the regression parameters. We consider the leverage points as a proxy for the quick survey of presence of the outliers. We can see in the greater values of ${h}_{t, t}$ is detected for the 2009 crisis and COVID-19 pandemic.

Figure 2. Outliers' detection. The Hat matrix diagonal values for the US Unemployment rate from December 2001 to January 2021.

DownLoad: Full-Size Img PowerPoint

4.2. Factor Analysis

The factors have been extracted from the monthly variables using three FM methodologies (FM Std, FM FMCD, and FM IRLS), and then they supply the forecast using a VAR model as described in Section 3.4. As regards to the quarterly variables we follow a nowcasting procedure. First, we derive the regression coefficients of the quarterly variables on the nowcasted factors and secondly, we use the coefficients and the forecasted factors to forecast the quarterly variables.

We analyze the stability of the factors and the percentage of explained variance. We follow a rolling window estimation approach and analyze the out-of-sample forecast ability of the FM with a twelve-months horizon. There are 61 overlapping windows of 170 observations each. The first window is from December 2001 to January 2016, the second shift is one month from January 2002 to February 2016, and the 61st is from December 2006 to January 2021. See Figure 3 for a graphical illustration of the procedure (see Figure 3).

Figure 3. Rolling windows. The window size is 170 observations. We consider 61 overlapping windows. The first window is from December 2001 to January 2016 (second line), the second window is from January 2002 to February 2016 (third line), and the last is from December 2012 to January 2021 (last line). The green segments identify the data used to produce the factors and the forecast. The forecast's horizon is twelve months and is identified by the orange. Segments. The red segments indicate the 2009 crise and the COVID-19 pandemic.

DownLoad: Full-Size Img PowerPoint

In our empirical applications, the $k$ factors are used to forecast the variables of interest, which are the Unemployment rate and the Harmonized Index of Consumer Prices (HCPI); with nowcasting procedure we produce the forecast also for GDP. Our choice is to explain a given proportion of variance ${V}_{k} < 1$ in Eq. (6) with a reduced number of factors in order to limit the dimension of forecast model (i.e., the VAR model). For example, in our application we choose to explain at least 80% of the variance, ${V}_{k} = 0.8$ , with no more than 9 factors.

reports the values of the leverage point ${h}_{t, t}$ estimated from the panel series in some relevant windows (see plot labels). The value of ${h}_{t, t}$ increases slowly in the observation windows where the 2009 crisis is included (e.g., see plots w1, w19, w31, and w43). When the observations associated to the COVID-19 pandemic period are included in the samples, then ${h}_{t, t}$ reaches much larger values, about $1$ (see w51, w52, w53, and w58). A double peak appeared in the window w61 due to the second wave in the CODIV-19 pandemic.

Figure 4. Leverage points

${\mathrm{h}}_{\mathrm{t}, \mathrm{t}}$ for some relevant windows.

DownLoad: Full-Size Img PowerPoint

In conclusion, we consider COVID-19 as the greater cause of outliers that the researchers are facing. For this reason, we choose a percentage of $r = 0.95T$ for the values to be saved in FMCD algorithm.

Figure 5 shows the eigenvalues (left column) and the contribution of the first 9 factors (right) to the variance for the three FMs. The graphs refer to the data of the last window in Figure 3. The scale of the eigenvalues differs across models since the weights used in FMCD and IRLS have different size. The decay rate of the spectrum is similar across models, and this indicates small number of factors explain a large proportion of variance. The FM FMCD and FM IRLS models intercept a smaller proportion of variance than in the FM Std case.

Figure 5. Eigenvalues (left) and contribution of the factors (right) for the FM Std (top), FM FMCD (middle) and FM IRLS (bottom).

DownLoad: Full-Size Img PowerPoint

The green lines in shows the weights used in the FMCD (left) and IRLS (right). Setting $r = 0.95T$ yields weight equal to one for all windows expect for the COVID-19 pandemic windows where the weight is equal to zero. For the IRLS the weights are strictly positive for all estimation windows and below one. The two weight sequences have different impact on the extraction of the factors (e.g., see the first factor in the same figure and the three factors in Figure 7).

Figure 6. Factor 1 extracted with the FM Std (magenta line), the FM FMCD (left, blue line) and FM IRLS (right, blue line), and the weights used (green line). Due to the configuration of the weights, the amplitude of the peak associated with the pandemics (see top plot in Figure 7) is reduced in the FM FMCD model, whereas both peaks are largely reduced in the IRLS model.

DownLoad: Full-Size Img PowerPoint

Figure 7. The first three factors (blue line), their out-of-sample forecasts (magenta solid lines) with the confidence bands (magenta dashed lines), for the FM Std (top), FM FMCD (middle) and FM IRLS (bottom).

DownLoad: Full-Size Img PowerPoint

The FM Std model factors exhibit at least 2 peaks corresponding to the 2009 crisis and COVID-19 pandemics windows. In the robust FM procedures, the weight sequences reduce substantially the effects of the two sources of outliers.

In Table 2, we illustrate the bias issues induced by the presence of outliers, by comparing the correlation between the variables in the dataset (columns) and the first factor of the three models (rows), estimated in the last windows (w61). The first factor in the three models explains the 26%, 23%, and 20% of the variance, respectively. The set of the most correlated variables in the standard FM model differs from the one of the FM FMCD and FM IRLS models, which indicates that the bias in the estimation of the factor can be large in the FM models if the outliers are not treated properly.

Table 2. The 10 most correlated variables with the first factors for the three methodologies in the 61st (December 2006–January 2021) window.

FM Std: Correlations between Factors 1 and Variables.
Country	USA	USA	EU	USA	EU	USA	USA	EU	USA	EU
Indicator	EP	EA	Ex	Ex	RS	Ahw	Er	Im	IP	LeIn
Measure	thousands	thousands	m/m	m/m	y/y	m/m	m/m	m/m	y/y	level
Factor 1 w61	0.835	0.835	0.798	0.785	0.726	–0,724	0.721	0.708	0.686	0.669
FM FMCD: Correlations between Factors 1 and Variables.
Country	USA	EU	EU	EU	EU	USA	USA	USA	USA	EU
Indicator	PPI	PMImI	ESI	LeIn	IP	Stocks	PCE	ISMm	Upc	PPI
Measure	y/y	level	level	level	y/y	m/m	y/y	level	%	y/y
Factor 1 w61	–0.847	–0.762	–0.759	–0.730	–0.728	–0.708	–0.707	–0.680	–0.668	–0.600
FM IRLS: Correlations between Factors 1 and Variables.
Country	EU	USA	EU	USA	EU	EU	USA	USA	USA	EU
Indicator	LeIn	PPI	ESI	Upc	IP	PPI	PCE	Stocks	PCEc	PMImI
Measure	level	y/y	level	%	y/y	y/y	y/y	m/m	y/y	level
Factor 1 w61	–0.877	–0.852	–0.840	–0.796	–0.794	–0.722	–0.708	–0.674	–0.666	–0.623

| Show Table

DownLoad: CSV

FM FMCD and FM IRLS share 9 common variables, and the correlation levels are similar in the two models; the result indicates that the choice of the weights can have an impact on the results, but the economic interpretation of the factors is not affected too much.

4.3. Forecast comparison

We use the rolling window analysis introduced in the previous section for comparing the three models: FM Std, FM FMCD and FM IRLS. For each window, the models produce 12 forecasts out of sample (see Figure 3). We measure and compare sequentially the ability of the models to forecast the following variables: GDP, Unemployment rate, as well as PCE for both the EU and the US regions.

For every window, we determine the factors and compute the forecast at the horizon of 12 months for the monthly variables and 4 quarters for the quarterly variables. The rolling window of 170 observations is moved forward by one month, and the forecasts are computed again. We repeat this exercise 61 times until the end date of the observation window coincides with January 2021.

For every series, compute the square of the difference between the forecast and the actual values, sum the squared differences, divide them by the total number of forecast points, and take a square root to obtain the Root Mean Square Error (RMSE). Let ${s}_{t}$ be the forecast horizon at time $t$ for monthly data. In our application, it is equal to 12 for all $t$ except when the end of the window is close to January 2021, when the horizon decreases. Moreover, let $TEp = 61$ be the number of forecasts for each one of the ${s}_{t}$ months.

At time t, we have the following error for every forecast (we omit here the identification of the variable):

$e\left(t\right) = \sum\limits _{i = 1}^{{s}_{t}}{\left[f\left(t+i\right)-v\left(t+i\right)\right]}^{2}$

(24)

where $f\left(t+i\right)$ indicates the forecast for the variable $v\left(t+i\right)$ . The forecast is made at time $t$ with forecasting horizon $i$ . The RMSE is thus defined as follows:

$RMSE = \sqrt{\frac{1}{\sum\nolimits_{t = 1}^{TEp}{s}_{t}}\sum\limits_{j = 1}^{TEp}e\left(j\right)}$

(25)

As a first step, we show the RMSE value for the first three factors, that intercept more than 52.5%, 47.8%, 46.7% of the total variance for FM Std, FM FMCD, FM IRLS respectively.

The left column of Figure 8 shows the actual values of the three factors (solid blue lines, in the rows), the 12-step-ahead FM forecasts (dashed lines), and their envelope (solid red lines), which can be considered an approximation of the forecasting error bands. The forecast comparison includes the COVID-19 pandemic period, but cannot be made for the 2009 crisis one, due to the choice of the rolling window size (see Figure 3). Thus, in the following we focus on the forecast ability during the COVID-19 period.

Figure 8. In the rows, the first three factors FM Std (left), FM FMDC (middle) and FM IRLS (right). In each plot, the actual value of the factor (blue solid), the forecasts (red dashed) and the forecast envelop (red solid).

DownLoad: Full-Size Img PowerPoint

For the first factor, the actual values belong to the envelope region for all periods except for the pandemic crisis periods, which reveal the difficulties in predicting the effects of the pandemic events. A similar behavior can be detected for the second factor. The middle and right columns in Figure 8 show the first three factors for FMCD and IRLS methodologies respectively; their behavior is comparable only by graphical point of view, because they have different scale due to the two applied algorithms. Figure 9 shows the forecasts and the RMSEs for our variables of interest and for the three methodologies: FM Std (left column), FM FMCD (middle) and FM IRLS (right).

Figure 9. In the rows, the variables forecasted with the FM Std (left), FM FMDC (middle) and FM IRLS (right) model. In each plot, the actual value of the variable (blue solid), the forecasts (red dashed) and the forecast envelop (red solid).

DownLoad: Full-Size Img PowerPoint

By using the envelope (solid red lines) as reference lines, it is possible to compare graphically the forecast performance of the models. Since the actual data belong to the area delimited by the envelope of the FM FMCD and FM IRLS models, we conclude that they usually perform better than FM Std. The lower RMSE level of the FM FMCD and FM IRLS model for both the monthly and quarterly variables allows us to confirm this result (see panel (a) in Table 3).

Table 3. Root Mean Square Error (RMSE) for the variables of interest (rows) following different forecasting models (columns).

Panel a. Cross-horizon overall RMSEs.
RMSE	FM Std			FM FMCD					FM IRLS
USA GDP	0.42			0.39					0.36
USA Unemployment rate	40.06			33.62					33.65
USA PCE	0.83			0.62					0.49
EU GDP	0.48			0.41					0.42
EU Unemployment rate	0.80			0.72					0.70
EU HCPI	0.98			0.81					0.80
Panel b. RMSEs at different horizons.
	1 month ahead			5 months ahead			7 months ahead			12 months ahead
RMSE	FM Std	FM FMCD	FM IRLS	FM Std	FM FMCD	FM IRLS	FM Std	FM FMCD	FM IRLS	FM Std	FM FMCD	FM IRLS
USA Unempl. Rate	3.54	3.11	3.11	33.53	27.06	27.08	56.97	47.31	47.35	4.91	3.61	3.32
USA PCE	0.49	0.46	0.31	0.79	0.60	0.47	0.82	0.69	0.52	1.05	0.67	0.58
EU Unempl. rate	0.16	0.15	0.14	0.66	0.62	0.65	0.80	0.75	0.68	1.29	1.38	0.98
EU HCPI	0.51	0.49	0.45	0.91	0.79	0.74	1.00	0.87	0.82	1.25	1.06	1.05

| Show Table

DownLoad: CSV

The effect of outliers on the most impacted variables propagates to the forecast of the other variables through the factors, which can explain the bad performance of the standard FM model. The variables of interest that are the most difficult to predict are the GDPs and the Unemployment rate. Their values have been most affected by the crisis. On the other hand, prices maintain good predictability for both regions because this variable was not heavily penalized by the crisis. The impact of outliers can be reduced in the FM FMCD and FM IRLS models, nevertheless, for the US Unemployment Rate, the effects of COVID-19 on the forecasting performances have been disruptive for all the three methodologies.

In we can see that RMSE of the 12-month-ahead forecast for USA Unemployment Rate is smaller than one of the forecasts at 5 and 7 months ahead. This is mainly due to the error magnitude of forecast done in April 2020. This forecast exercise includes in its horizon the first sample impacted by the COVID-19 and has a very large forecast error. Since the dataset ends in January 2021, the forecast horizon ${s}_{t}$ in this exercise is 9 months (see Figure 3); which implies that for this forecast it is possible to measure the errors at 1, 5, 7 months but not at 12.

Finally, following the guidelines provided by Eurostat (2020) on modelling outliers due to COVID-19, we monitor sequentially the forecasting errors. The RMSEs for the one-, two-, seven- and twelve-step-ahead forecasts of the three methodologies indicate that the FM FMCD and FM IRLS models have better performances than the FM Std model at all the horizons (see Figures 10, 11 and 12 in Appendix). The numerical results in the panel (b) of Table 3 suggest the FM IRLS model has superior forecasting ability at all horizons.

The bottom line from this section is the following:

● the sample observations during the 2009 crisis and the 2020 COVID-19 pandemic heavily affect factor estimates obtained with the standard procedure;

● consequently, standard factor models can produce significant forecasting errors in the presence of outliers, whereas robust models perform better;

● the variables most impacted by the 2009 crisis and the pandemic (such as GDP and unemployment) exhibit the most significant forecast errors in all estimation procedures;

● the sequential forecasting comparison between MCD and IRLS showed that the latter approach usually leads to superior forecasting performances.

5. Conclusions

Outliers can have disruptive effects on inference, biasing the estimates and the conclusion of the statistical analysis. Through the lens of factor models we provide evidence of the effects of outliers due to the 2009 crisis and the COVID-19 pandemic on the forecast abilities of the models. We applied two techniques for robust factor estimation based on robust covariance matrix estimators. The robust methodologies that we chose have the advantage of avoiding data deletion or manipulation. We compare the standard factor estimation with the robust estimation approaches for an extended period and on a set of relevant variables. The choice to include the COVID-19 pandemic period in the estimation and forecasting exercises has the scope to highlight the relevance of handling outliers in periods of large shocks to the world's economies. We show that robust estimation can reduce outliers' influence and produce good forecasts.

Conflict of interest

All authors declare no conflicts of interest in this paper.

References

[1]	S. Kou, A jump diffusion model for option pricing, Manage. Sci., 48 (2002), 1086–1101. https://doi.org/10.1287/mnsc.48.8.1086.166 doi: 10.1287/mnsc.48.8.1086.166
[2]	S. Kou, H. Wang, First passage times of a jump diffusion process, Adv. Appl. Probab., 35 (2003), 504–531. https://doi.org/10.1239/aap/1051201658 doi: 10.1239/aap/1051201658
[3]	H. Gao, C. Yin, Discounted densities of overshoot and undershoot for Lévy processes with applications in finance, Probab. Eng. Inf. Sci., 38 (2024), 644–667. https://doi.org/10.1017/S0269964824000032 doi: 10.1017/S0269964824000032
[4]	L. Alili, A. E. Kyprianou, Some remarks on first passage of Levy processes, the American put and pasting principles, Ann. Appl. Probab., 15 (2005), 2062–2080. https://doi.org/10.1214/105051605000000377 doi: 10.1214/105051605000000377
[5]	N. Cai, S. Kou, Option pricing under a mixedexponential jump diffusion model, Manage. Sci., 57 (2011), 2067–2081. http://dx.doi.org/10.1287/mnsc.1110.1393 doi: 10.1287/mnsc.1110.1393
[6]	H. U. Gerber, An extension of the renewal equation and its application in the collective theory of risk, Skand. Aktuarietidskrif, 1970 (1970), 205–210. https://doi.org/10.1080/03461238.1970.10405664 doi: 10.1080/03461238.1970.10405664
[7]	F. Dufresne, H. U. Gerber, Risk theory for the compound Poisson process that is perturbed by diffusion, Insur. Math. Econ., 10 (1991), 51–59. https://doi.org/10.1016/0167-6687(91)90023-Q doi: 10.1016/0167-6687(91)90023-Q
[8]	C. C. L. Tsai, On the discounted distribution functions of the surplus process perturbed by diffusion, Insur. Math. Econ., 28 (2001), 401–419. https://doi.org/10.1016/S0167-6687(01)00067-1 doi: 10.1016/S0167-6687(01)00067-1
[9]	C. C. L. Tsai, On the expectations of the present values of the time of ruin perturbed by diffusion, Insur. Math. Econ., 32 (2003), 413–429. https://doi.org/10.1016/S0167-6687(03)00130-6 doi: 10.1016/S0167-6687(03)00130-6
[10]	Z. Zhang, H. Yang, Gerber-Shiu analysis in a perturbed risk model with dependence between claim sizes and interclaim times, J. Comput. Appl. Math., 235 (2011), 1189–1204. https://doi.org/10.1016/j.cam.2010.08.003 doi: 10.1016/j.cam.2010.08.003
[11]	F. Ad $\acute{e}$ kambi, E. Takouda, On the discounted penalty function in a perturbed Erlang renewal risk model with dependence, Methodol. Comput. Appl. Probab., 24 (2022), 481–513. https://doi.org/10.1007/s11009-022-09944-3 doi: 10.1007/s11009-022-09944-3
[12]	Y. Aït-Sahalia, J. Jacod, Testing for jumps in a discretely observed process, Ann. Stat., 37 (2009), 184–222. https://doi.org/10.1214/07-AOS568 doi: 10.1214/07-AOS568
[13]	Y. Chi, Analysis of the expected discounted penalty function for a general jump-diffusion risk model and applications in finance, Insur. Math. Econ., 46 (2010), 385–396. https://doi.org/10.1016/j.insmatheco.2009.12.004 doi: 10.1016/j.insmatheco.2009.12.004
[14]	Y. Chi, X. Lin, On the threshold dividend strategy for a generalized jump-diffusion risk model, Insur. Math. Econ., 48 (2011), 326–337. https://doi.org/10.1016/j.insmatheco.2010.11.006 doi: 10.1016/j.insmatheco.2010.11.006
[15]	C. Yin, Y. Shen, Y. Wen, Exit problems for jump processes with applications to dividend problems, J. Comput. Appl. Math., 245 (2013), 30–52. https://doi.org/10.1016/j.cam.2012.12.004 doi: 10.1016/j.cam.2012.12.004
[16]	C. Yin, Y. Wen, Z. Zong, Y. Shen, The first passage time problem for mixed-exponential jump processes with applications in insurance and finance, Abstr. Appl. Anal., 2014 (2014), 571724. https://doi.org/10.1155/2014/571724 doi: 10.1155/2014/571724
[17]	Z. Zhang, H. Yang, H. Yang, On a Sparre Andersen risk model with time-dependent claim Sizes and jump-diffusion perturbation, Methodol. Comput. Appl. Probab., 14 (2012), 973–995. https://doi.org/10.1007/s11009-011-9215-1 doi: 10.1007/s11009-011-9215-1
[18]	M. Boudreault, H. Cossette, D. Landriault, E. Marceau, On a risk model with dependence between interclaim arrivals and claim sizes, Scand. Actuar. J., 2006 (2006), 265–285. https://doi.org/10.1080/03461230600992266 doi: 10.1080/03461230600992266
[19]	S. Chadjiconstantinidis, S. Vrontos, On a renewal risk process with dependence under a Farlie-GumbelMorgenstern copula, Scand. Actuar. J., 2014 (2012), 125–158. https://doi.org/10.1080/03461238.2012.663730 doi: 10.1080/03461238.2012.663730
[20]	J. Xie, W. Zou, On the expected discounted penalty function for a risk model with dependence under a multi-layer dividend strategy, Commun. Stat.-Theor. M., 46 (2017), 1898–1915. https://doi.org/10.1080/03610926.2015.1030424 doi: 10.1080/03610926.2015.1030424
[21]	H. U. Gerber, E. S. W. Shiu, On the time value of ruin, N. Am. Actuar. J., 2 (1998), 48–72. https://doi.org/10.1080/10920277.1998.10595671 doi: 10.1080/10920277.1998.10595671
[22]	Y. He, R. Kawai, Y. Shimizu, K. Yamazaki, The Gerber-Shiu discounted penalty function: A review from practical perspectives, Insur. Math. Econ., 109 (2023), 1–28. https://doi.org/10.1016/j.insmatheco.2022.12.003 doi: 10.1016/j.insmatheco.2022.12.003
[23]	D. C. M. Dickson, C. Hipp, On the time to ruin for Erlang(2) risk processes, Insur. Math. Econ., 29 (2001), 333–344. https://doi.org/10.1016/S0167-6687(01)00091-9 doi: 10.1016/S0167-6687(01)00091-9
[24]	S. Li, J. Garrido, On ruin for the Erlang(n) risk process, Insur. Math. Econ., 34 (2004), 391–408. https://doi.org/10.1016/j.insmatheco.2004.01.002 doi: 10.1016/j.insmatheco.2004.01.002

This article has been cited by:

1.	Xiangtao Chen, Yuting Bai, Peng Wang, Jiawei Luo, Data augmentation based semi-supervised method to improve COVID-19 CT classification, 2023, 20, 1551-0018, 6838, 10.3934/mbe.2023294
2.	Juan Zhou, Xiong Li, Yuanting Ma, Zejiu Wu, Ziruo Xie, Yuqi Zhang, Yiming Wei, Optimal modeling of anti-breast cancer candidate drugs screening based on multi-model ensemble learning with imbalanced data, 2023, 20, 1551-0018, 5117, 10.3934/mbe.2023237
3.	Laura Grassini, Statistical features and economic impact of Covid-19, 2023, 5, 2689-3010, 38, 10.3934/NAR.2023003
4.	Wenhui Feng, Xingfa Zhang, Yanshan Chen, Zefang Song, Linear regression estimation using intraday high frequency data, 2023, 8, 2473-6988, 13123, 10.3934/math.2023662
5.	Zejun Li, Yuxiang Zhang, Yuting Bai, Xiaohui Xie, Lijun Zeng, IMC-MDA: Prediction of miRNA-disease association based on induction matrix completion, 2023, 20, 1551-0018, 10659, 10.3934/mbe.2023471
6.	Yue Ma, Zhongfei Li, Robust portfolio choice with limited attention, 2023, 31, 2688-1594, 3666, 10.3934/era.2023186
7.	Jie Zheng, Yijun Li, Machine learning model of tax arrears prediction based on knowledge graph, 2023, 31, 2688-1594, 4057, 10.3934/era.2023206
8.	Jun Zhao, Peibiao Zhao, Quantile hedging for contingent claims in an uncertain financial environment, 2023, 8, 2473-6988, 15651, 10.3934/math.2023799
9.	Zahra Hajirahimi, Mehdi Khashei, Negar Bakhtiarvand, A linear directional optimum weighting (LDOW) approach for parallel hybridization of classifiers, 2024, 161, 15684946, 111754, 10.1016/j.asoc.2024.111754
10.	Sangjae Lee, Joon Yeon Choeh, Exploring the influence of online word-of-mouth on hotel booking prices: insights from regression and ensemble-based machine learning methods, 2024, 4, 2769-2140, 65, 10.3934/DSFE.2024003

Reader Comments

Your name:*

Email:*
© 2025 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.1

Metrics

Article views(268) PDF downloads(18) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(1) / Tables(1)

AIMS Mathematics

On a dependent risk model perturbed by mixed-exponential jump-diffusion processes

Related Papers:

Abstract

1. Introduction

2. Background on robust estimation

3. Factor models

3.1. A standard factor model

3.1.1. Subheading

3.2. A robust factor model: the fast minimum covariance determinant estimator

3.3. A robust factor model: the iterated reweighted least squares estimator

3.4. A forecasting model

4. Empirical applications

4.1. Data description

4.2. Factor Analysis

4.3. Forecast comparison

5. Conclusions

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

AIMS Mathematics

On a dependent risk model perturbed by mixed-exponential jump-diffusion processes

Related Papers:

Abstract

1. Introduction

2. Background on robust estimation

3. Factor models

3.1. A standard factor model

3.1.1. Subheading

3.2. A robust factor model: the fast minimum covariance determinant estimator

3.3. A robust factor model: the iterated reweighted least squares estimator

3.4. A forecasting model

4. Empirical applications

4.1. Data description

4.2. Factor Analysis

4.3. Forecast comparison

5. Conclusions

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog