
We propose and study computationally a novel non-local multiscale moving boundary mathematical model for tumour and oncolytic virus (OV) interactions when we consider the go or grow hypothesis for cancer dynamics. This spatio-temporal model focuses on two cancer cell phenotypes that can be infected with the OV or remain uninfected, and which can either move in response to the extracellular-matrix (ECM) density or proliferate. The interactions between cancer cells, those among cancer cells and ECM, and those among cells and OV occur at the macroscale. At the micro-scale, we focus on the interactions between cells and matrix degrading enzymes (MDEs) that impact the movement of tumour boundary. With the help of this multiscale model we explore the impact on tumour invasion patterns of two different assumptions that we consider in regard to cell-cell and cell-matrix interactions. In particular we investigate model dynamics when we assume that cancer cell fluxes are the result of local advection in response to the density of extracellular matrix (ECM), or of non-local advection in response to cell-ECM adhesion. We also investigate the role of the transition rates between mainly-moving and mainly-growing cancer cell sub-populations, as well as the role of virus infection rate and virus replication rate on the overall tumour dynamics.
Citation: Abdulhamed Alsisi, Raluca Eftimie, Dumitru Trucu. Non-local multiscale approach for the impact of go or grow hypothesis on tumour-viruses interactions[J]. Mathematical Biosciences and Engineering, 2021, 18(5): 5252-5284. doi: 10.3934/mbe.2021267
[1] | Norman Maswanganyi, Edmore Ranganai, Caston Sigauke . Long-term peak electricity demand forecasting in South Africa: A quantile regression averaging approach. AIMS Energy, 2019, 7(6): 857-882. doi: 10.3934/energy.2019.6.857 |
[2] | Marwa M. Ibrahim, Amr A. Elfeky, Amal El Berry . Forecasting energy production of a PV system connected by using NARX neural network model. AIMS Energy, 2024, 12(5): 968-983. doi: 10.3934/energy.2024045 |
[3] | Hassan Shirzeh, Fazel Naghdy, Philip Ciufo, Montserrat Ros . Stochastic energy balancing in substation energy management. AIMS Energy, 2015, 3(4): 810-837. doi: 10.3934/energy.2015.4.810 |
[4] | Mohamed Elweddad, Muhammet Güneşer, Ziyodulla Yusupov . Designing an energy management system for household consumptions with an off-grid hybrid power system. AIMS Energy, 2022, 10(4): 801-830. doi: 10.3934/energy.2022036 |
[5] | Kevin R Caffrey, Mari S Chinn, Matthew W Veal . Biomass supply chain management in North Carolina (part 1): predictive model for cropland conversion to biomass feedstocks. AIMS Energy, 2016, 4(2): 256-279. doi: 10.3934/energy.2016.2.256 |
[6] | Mulualem T. Yeshalem, Baseem Khan . Design of an off-grid hybrid PV/wind power system for remote mobile base station: A case study. AIMS Energy, 2017, 5(1): 96-112. doi: 10.3934/energy.2017.1.96 |
[7] | Bilal Akbar, Khuram Pervez Amber, Anila Kousar, Muhammad Waqar Aslam, Muhammad Anser Bashir, Muhammad Sajid Khan . Data-driven predictive models for daily electricity consumption of academic buildings. AIMS Energy, 2020, 8(5): 783-801. doi: 10.3934/energy.2020.5.783 |
[8] | Akram Jahanshahi, Dina Jahanianfard, Amid Mostafaie, Mohammadreza Kamali . An Auto Regressive Integrated Moving Average (ARIMA) Model for prediction of energy consumption by household sector in Euro area. AIMS Energy, 2019, 7(2): 151-164. doi: 10.3934/energy.2019.2.151 |
[9] | Sameer Thakare, Neeraj Dhanraj Bokde, Andrés E. Feijóo-Lorenzo . Forecasting different dimensions of liquidity in the intraday electricity markets: A review. AIMS Energy, 2023, 11(5): 918-959. doi: 10.3934/energy.2023044 |
[10] | Mehrdad Heidari, Alireza Soleimani, Maciej Dzikuć, Mehran Heidari, Sayed Hamid Hosseini Dolatabadi, Piotr Kuryło, Baseem Khan . Exploring synergistic ecological and economic energy solutions for low-urbanized areas through simulation-based analysis. AIMS Energy, 2024, 12(1): 119-151. doi: 10.3934/energy.2024006 |
We propose and study computationally a novel non-local multiscale moving boundary mathematical model for tumour and oncolytic virus (OV) interactions when we consider the go or grow hypothesis for cancer dynamics. This spatio-temporal model focuses on two cancer cell phenotypes that can be infected with the OV or remain uninfected, and which can either move in response to the extracellular-matrix (ECM) density or proliferate. The interactions between cancer cells, those among cancer cells and ECM, and those among cells and OV occur at the macroscale. At the micro-scale, we focus on the interactions between cells and matrix degrading enzymes (MDEs) that impact the movement of tumour boundary. With the help of this multiscale model we explore the impact on tumour invasion patterns of two different assumptions that we consider in regard to cell-cell and cell-matrix interactions. In particular we investigate model dynamics when we assume that cancer cell fluxes are the result of local advection in response to the density of extracellular matrix (ECM), or of non-local advection in response to cell-ECM adhesion. We also investigate the role of the transition rates between mainly-moving and mainly-growing cancer cell sub-populations, as well as the role of virus infection rate and virus replication rate on the overall tumour dynamics.
The monthly, quarterly and annual time series data display seasonality, fluctuation, nonlinearities, and so on. The total electricity production data often has some characteristics, such as seasonal patterns, trends and nonlinearity. To overcome the issue and obtain more accurate forecasts, the various models have been applied in the past several decades. Traditionally, the univariate time series models are often included in forecasting comparisons: the sARIMA model and the HW model. The main advantage of using univariate time series models is that they do not consider the relationship between dependent and independent variables, but they are used to predict the future values of a time series by using only historical data. One of the most important issues in model selection is to determine the appropriate approach to utilize forecasting. The sARIMA and the HW models are widely used forecasting methods which can capture the seasonality. For the sARIMA and the HW models, the parameters representing the differential or seasonal need to be estimated. The popularity of the sARIMA and the HW models is not only due to their success in forecasting, but also their simplicity and analytical ability. However, empirical findings demonstrate that the sARIMA and the HW models have a drawback. For example, both models only assume the linear form of the model. The nonlinear patterns cannot be captured by the sARIMA and the HW models. On the other hand, the nonlinear models, such as the ANN and the ANFIS, can only capture nonlinearity in data. For this reason, the nonlinear models are not able to model the linear part in data. Due to the shortcomings of linear and nonlinear models, the hybrid forecasting approach has come to the forefront over the past decade.
The motivation of this study is based on the following remarks:
− The study by Makridakis et al. [1] manifests that the complex models usually fit past data well, but forecasting the future, the simple models are often more accurate;
− Khashei and Bijari [2] put forward an approach to encourage motivation for combining different single methods to benefit from their strength of capturing different characteristics of the time series data.
Thus, the aim of this study is to forecast the total electricity production in South Africa under two different types of approaches. The first approach assumes that simple forecasting models may perform better than the models of complex form. The second approach assumes that the hybrid models may have better out-of-sample forecasting performances than simple models. The main reason for using hybrid models is that forecasting accuracy can be enhanced by using the linear and the non-linear features.
The forecasting models found in the literature are divided in two main frames: simple models and complex models. For this study, the focus is on the forecasts by sARIMA and Holt-Winter's as simple models and by hybrid models, such as the ANN based sARIMA, the ANN based HW models, the ANFIS based sARIMA and the ANFIS based HW models. The ARIMA model is one of the most optimal forecasting models in the literature. Besides being simple and useful, the Holt-Winters method and its versions, 'additive' and 'multiplicative', are significantly robust for a wide range of energy applications. Bianchi et al. [3] provide the evidence that the Holt-Winters method performs as well as or better than more complex methods. De Gooijer and Hyndman [4] claim that the ARIMA is a robust method to deal with trend and seasonality. Armstrong [5] reports that the ARIMA model improve forecasting accuracy but there is a little evidence that supports this claim. Chen et al. [6] find that the sARIMA is the most appropriate model for forecasting inbound air travel arrivals to Taiwan. Gelper et al. [7] put forward that the Holt-Winters method is a useful method if data shows trend and seasonality. Permanasari et al. [8] indicate that the ARIMA model has better forecasting performance than the Holt-Winters method by using disease incidence, especially for the seasonal disease. Omane-Adjepong et al. [9] state that the sARIMA model yields more accurate forecasts than the seasonal additive and multiplicative Holt-Winters methods. Rahman and Ahmar [10] show that the Holt-Winters additive type is better than the ARIMA model for the total primary energy consumption data. de Oliveira and Oliveira [11] use the combination of decomposition and bootstrap agammaegating techniques to improve forecasts based on the univariate models, such as the sARIMA model, the HW additive and multiplicative models and the ETS model for electric energy demand across different countries. González et al. [12] employ the Hilbertian autoregressive moving aveage (ARMX) model to forecast electricity price. In terms of the linearity, much attention has been directed in recent literature to the linear quantile regression to model electricity demand. Lebotsa et al. [13] employ partially linear additive quantile regression to forecast the short-term electricity demand.
The study, by Darbellay and Slama [14], discussed that the ANN model can be useful for nonlinear process. Chatfield [15,16] questioned whether the concerns with the ANN model were exaggerated regarding the perfect forecasting technique. There is no doubt that Chatfield was not the only one in awe of the forecasting performance of ANN model. This was followed by several papers discussing that naïve models' forecasting performance, such as random walk, can beat the ANNs [17,18,19,20,21]. The authors of [2,22,23] show that ANNs produce promising results compared to the traditional time series models, such as the ARIMA model. Besides the ANN model, there is a large body of literature to attest to the fact that grey prediction model has significant impact on electricity consumption forecasting. Ding et al. [24] use a new grey model to forecast China's electricity consumption. Hu [25] forecasts electricity consumption by using the ANN based grey forecasting method. In terms of the comparison between the ANFIS model and the ARIMA model, there exist various results in the literature. Tektaş [26] show that the ANFIS model has better results than the ARIMA model in terms of forecasting performance. Yayar et al. [27] conclude that the ANFIS model is more appropriate than the ARIMA in point of the forecasting of electric consumption. Yadav and Balakrishnan [28] find that the ANFIS model performs better than the ARIMA model. However, Hernandez et al. [29] argue that the ARIMA model is better than the ANFIS model in terms of forecasting. Lusis et al. [30] in their study state that the support vector regression model (SVR) has higher accuracy for a day-ahead load forecast. Luo et al. [31] show that even if the SVR model is most robust rather than the multiple linear regression model and the ANN model, all models do not satisfy forecasting when the scale of the data integrity attacks becomes large. Li et al. [32] suggest that the combination of ensemble empirical mode decomposition (EEMD) and random forest model to forecast daily electricity consumption shows the best forecasting performance, in comparison to others, such as a back-propagation neural network (BPNN) and least square support vector machine (LSSVM). Chen et al. [33] suggest that their proposed method based on the combination of EMD and extreme learning machine (ELM) provides better forecasting performance than all other three methods, such as radial basis function kernel (RBF) based ELM, universal kernel function (UKF) based ELM and mixed-ELM, in terms of electric load forecasting.
There is known fact that the series generated from linear process might be inappropriate for most real-world problems that are non-linear [34,35]. As seen in many studies, the literature holds different views in respect to the forecasting performances of linear and non-linear models. A time series usually consists of the linear and the nonlinear parts. Herein, one particular type of model that has come into prominence is the hybrid model. The hybrid model is a forecasting technique that combined other individual models. In this study, the ARIMA and Holt-Winters models for the linear part of a time series data were used, and the ANN and the ANFIS models are used to handle the non-linear part that corresponds to error term for a time series data. The process for the hybrid models used in this study was presented in Figure 1.
As seen in the literature above, there are widely studied the linear and the nonlinear models to forecast different data in energy sector. These models can be either univariate or multivariate models (such as multivariate linear quantile regression for linear model and grey forecasting approach for nonlinear model). The reason of focusing on the univariate models is that the univariate models outperform than the multivariate models in terms of out-of-sample forecasting [36]. Moreover, the study by Kunst [36] explains the possible reasons why the multivariate models have not been used in this study: 1) it is clearly seen that the number of parameter to be estimated in multivariate model is more than the univariate ones. Each parameter to be added to the model, which refers to unknown quantity, results in additional source of error; 2) in terms of the multivariate models, there are not only more model structures than univariate ones (which will cause model selection problem), but also there are decision-making issues which need to be faced according to the selection of independent variables which have impact on the dependent or the output variables (for example, in the context of technical economic analysis, the various inputs need to be used to mathematical modeling of the combined methanol-electricity production plant [37]). However, in the light of the forecasting process, the inaccurate selection of the independent variables will increase the noise, and this will lead to the poor forecasts.
The rest of the paper is organized as follows: In Section 2, the descriptive statistics of the dataset was briefly introduced. Section 3 of the paper provides necessary description of the ARIMA, the Holt-Winters, the ANN and the ANFIS models. The forecasted results are presented in Section 4. Section 5 contains conclusion.
In this study, the quarterly total electricity production in South Africa for the period from January 1985 (1985:Q1) to September 2017 (2017:Q3) were used. The data are provided from the internet page of Federal Reserve Economic Data. In addition to the data, R program is used to forecast total electricity production. Figure 2 shows the time-series plot of the data.
According to Figure 2, the total electricity production decreases in December. In South Africa, the summers last from November to January, and the summers are hot and generally dry. According to World Meteorological Organization [38], January 2013 was the hottest month. The electricity production indeed declines in dry, hot months if river flows ebb [39]. Because South Africa is a water scarce country and has limited water storage, the total electricity production forecasting is important for both future planning and policy [40]. Since the data shows the seasonal pattern, the sARIMA and the HW models are applied to capture the seasonality. Figure 3-a allows the underlying seasonal pattern to be seen more clearly. According to Figure 3-a, the total electricity production has consistently increased over the years as the lower (darker) lines represent earlier years, and the higher (lighter) lines represent recent years. Moreover, Figure 3-b shows the changes in seasonality over time. Table 1 shows the descriptive statistics of the data and confirms the left skewness and platykurtic distribution. The data with low kurtosis, the less than number three, tends to have light tails, or lack of outliers. This finding supports our aim which is to hold all data points in the model. Otherwise, the outliers in dataset need to be removed, and it is known fact that they can have significant adverse impact on our forecasts.
Mean | Maximum | Minimum | Std. Dev. | Skewness | Kurtosis |
17800.330 | 22868.000 | 11073.000 | 3442.301 | −0.287 | 1.676 |
LB-Q (12) | p-value | J-B Test | p-value | ARCH (12) | p-value |
1207.133 | 0.000 | 11.36815 | 0.0034 | 3.687 | 0.978 |
As a comparison, the HW model by Holt [41] and Winters [42], the sARIMA model by Box and Jenkins [43], the ANN model by Rumelhart et al. [44] and the ANFIS model by Jang [45] were applied to the total electricity production. In this study, the HW model was used because it is not only a widely used tool for forecasting method which can handle trend and seasonal variation, but also outperforms the sophisticated ones according to Makridakis and Hibon [46]. On the other hand, the forecasting competition between simple and hybrid models has attracted considerable academic attention. In recent years the ANN and the ANFIS models have shown better modeling performance than the other nonlinear models when dealing with modeling the nonlinear patterns in data. Moreover, the ANN and the ANFIS models have their popularity in terms of their applications and are also well embedded in the literature. Therefore, the ANN based on sARIMA, the ANN based on HW models, the ANFIS based on sARIMA model and the ANFIS based on HW models in forecasting the total electricity production have been used to allow for fair competing in the study. In this part, the basic framework of modelling approaches of the sARIMA model, the HW model, the ANN model and the ANFIS model are briefly described. To better understand the methodological baseline, the two utmost important criteria are considered to guide methodology selection. First, the methodology selection is based on the forecasting related literature. The second criteria to the methodology selection is based on the data characteristics.
Autoregressive Moving Average Model (ARMA) has become considerably important tool in forecasting of economic time series. The autoregressive part is called as Autoregressive model (AR), and the moving average part is called as Moving Average (MA) model. In AR model, the output variable is a linear function of the previous behaviors of itself. In MA model, the output variable is a linear function of the current and various past values of a stochastic term. In ARMA (p, q) model, p and q are the orders of AR and MA models respectively. The ARMA (p, q) model is defined as follows [47]:
yt=p∑i=1Φiyt−i+et+q∑i=1θiet−i | (1) |
If the observed data is non-stationary, taking differencing (d) can be used to eliminate non-stationarity. The model is then referred as an ARIMA model. The ARIMA (p, d, q) model is given as follows [47]:
(1−Φ1B−Φ2B2−⋯−ΦpBp)(1−B)dyt=c+(1−θ1B−θ2B2−⋯−θqBq)et | (2) |
where yt denote a series of values for a time series of interest, and et denotes a series of random disturbances and is assumed to be white noise. Let be B the backward shift (or lag) operator.
In this study, the sARIMA model which includes additional seasonal terms in the ARIMA model was used because the quarterly total electricity production data was used as output variable. The sARIMA (p, d, q) (P, D, Q) model can be written as [48]:
(1−Φ1B−⋯−ΦpBp)ΦP(BS)(1−B)d(1−BS)yt=(1−θ1B−⋯−θqBq)θQ(BS)et | (3) |
where p is non-seasonal AR order; d is non-seasonal differencing; q is non-seasonal MA order; P is seasonal AR order; D is seasonal differencing; Q is seasonal MA order; and S is time span of repeating seasonal pattern.
The HW model, which can deal with a time series containing both trend and seasonality, is an extension of the Exponential Smoothing (ES) model. The HW model has two versions, additive and multiplicative. In this study, both additive and multiplicative versions of HW exponential smoothing model have been applied in the forecast of total electricity production. The general forecast functions for the additive and multiplicative HW models are as follows [49].
Additive formulation of HW:
ℓt=α(yt−st−m)+(1−α)(ℓt−1+bt−1)bt=β(ℓt−ℓt−1)+(1−β)bt−1st=γ(yt−1−ℓt−1−bt−1)+(1−γ)st−mˆyt+h|t=ℓt+bth+st+h−m | (4) |
Multiplicative formulation of HW:
ℓt=α(yt/st−m)+(1−α)(ℓt−1+bt−1)bt=β(ℓt−ℓt−1)+(1−β)bt−1st=γ(yt−1/(ℓt−1−bt−1))+(1−γ)st−mˆyt+h|t=(ℓt+bth)st+h−m | (5) |
where ℓt is the component of level, bt is the component of the slope, and st+h−m is the relevant seasonal component. The crude oil prices are denoted by y1,…,yn and the seasonal period is m (e.g., m = 12 for monthly data). Let ˆyt+h|t be the h-step forecast made using data to time t.
The most widely used ANNs in the forecasting field are multi-layer perceptrons (MLPs) in literature [50,51]. The ANN model can be divided into four stages: (1) the topology design stage consist of the choice of the ANN type, the number of its layers, the number of neurons in each layer, its inputs and outputs, the selection of training, validation and test samples; (2) in the training stage, the neural network learns the potential relationship between input variables and output variable, and the learning process is continued till it finds the minimum error; (3) in the validation stage, connection weights are adjusted with unknown data; and (4) the test stage is applied to evaluate the performance of the net with the test sample. Figure 5 shows the ANN architecture with two hidden layer nodes. In the ANN framework, while the total electricity production was considered as output variable, the all other explanatory variables that are the lagged values of the output variable were included in the model as input variables. Specifically, the determination of the number of hidden layers are very important. The theoretical findings show that there is no reason to use more than two hidden layers in the ANN model [52]. Moreover, Kurkova [53] provides the evidence that a feedforward neural network with two hidden layers should be used to countervail the lost efficiency in the usage of regular activation function. Nevertheless, the other reason of using two hidden layers is to severely reduce the total number of necessary hidden nodes. The final ANN model is four layered network which consist of lagged output units as inputs (from one-time lagged to four-time lagged), two hidden layers, and one output unit. The ANN (I × H1 × H2 × O) notations were used to represent the number of input variables (I), hidden units in each layer (H1and H2), and the output unit (O).
This method based on the theory of fuzzy set and fuzzy logic is proposed by Jang [45]. This method is composed of ANN system and FIS system. The ANN model is known as statistical data modelling tool. It refers to learning algorithm which can capture the complex patterns in the relationship between input and output data. The FIS comprises membership function, fuzzy logic operator and if-then rules. Figure 6 shows the ANFIS architecture. In addition to the other models, the ANFIS model which takes the advantages of using neural networks and fuzzy logic at the same time, was carried out to check whether it has the best performances among the six implemented models. In this study, genfis3 function of Fuzzy toolbox of MATLAB was performed to generate a FIS using fuzzy c-means clustering (FCM) by extracting a set of rules that models the data behavior. The function requires separate sets of input and output data as input arguments. the lagged values of total electricity production series were used as input variables.
For time series forecasting, two types hybrid models are employed as alternatives to widely used the ARIMA and the HW models. Recently, combined models with both linear and non-linear models have been attracted much attention. Zhang [54] used hybrid ARIMA-ANN model to handle both linear and non-linear parts of the data. The linear part was modelled by ARIMA, and the non-linear part was modelled by the ANN model. In this study, the same process detailed as below was followed.
Tt=Lt+Nt | (6) |
where Lt is the linear component and Nt is the non-linear component. Firstly, the sARIMA and the HW models for the linear component were used and then the non-linear relationship will be appeared in the residual of linear component modelling.
εt=Tt−ˆLt | (7) |
The non-linear function by ANN and ANFIS models are described as follows:
εt=f(εt−1,εt−2,εt−3,⋯,εt−n)+ξt | (8) |
where f is non-linear function and ξt is a random error. The Equation (6) can be re-written for the hybrid model as follows:
ˆTt=ˆLt+ˆNt | (9) |
In Equation (9), ˆNtshows the forecasted value corresponding to εt from Equation (8). To sum up, the following steps for the hybrid methodology can be written. The utilized neural network algorithm used in hybrid methodology is presented in Table 2.
Hybrid Methodology |
1. Given a time series x = [x1, x2, …, xn]. 2. Determine the training set (82%) and validation test (9%) size as in-sample, and test size (9%) as out-of-sample. 3. Normalize the time series data using min-max normalization using the following formula: zi = (xi-min(x))/(max(x)-min(x)) 4. Determine the best simple individual models and their parameters using the normalized vector in-sample period. 5. Obtain the forecasts using selected simple individual models. 6. De-normalize the forecasts to find ^Lt in Equation (7). 7. Obtain residuals (εt) by subtracting simple models' forecasts from actual values in Equation (7). 8. Select the best lag number to determine the number of inputs on the data of residual series. 9. Normalize the residual series using min-max normalization. 10. Obtain forecasts using ANN and ANFIS models. 11. De-normalize the forecasts to obtain ^Nt from Equation (8). 12. Combine the simple models' forecasts with ANN and ANFIS models' forecasts as in Equation (9). |
input: εi,j:the errors obtained by subtracting jth model predictions from actual values, ζm: model building period (training period), ζv: validation period, ζt: test period, i: the number of observed predictions, j: the number of univariate models, h1(thenumberofnodesinfirsthiddenlayer)=c(1,2,…,12), h2(thenumberofnodesinsecondhiddenlayer)=c(1,2,…,12), k: the time delay number of εi,j. |
1: Given εi,j, ζm, ζv 2: foreach j do foreach k do create datamatrix =cbind(Lag((εi,j,k=1,..4), εi,j) end for end for 3: Run neural network model {εi,j;ζm,h1,h2,ε′i,j;ζm} 4: Compute MAEζv,ε′i,j,h1,h2 5: Save minimum MAEζv,ε′i,j,h1,h2 to determine the optimum number of nodes in each hidden layer 6: Run neural network model {εi,j;ζm+ζv,h1,h2,ε′i,j;ζm} output: Optimal model − optimal ε′i,j with optimum h1 and h2 |
The data are divided into two parts, i.e. training and testing datasets. The total electricity production data from 1985:Q1 to 2014:Q3 are used as training datasets, and the data from 2014:Q4 to 2017:Q3 are used as testing data sets. The different parameters of p, d, q, P, D, Q and S were experimented in order to determine the best model that will give the best forecast. The KPSS unit root test and AIC information criteria were used to determine the degree of differencing d and the appropriate seasonal orders for the purpose of choosing the best sARIMA model. In the light of the findings, sARIMA (1, 0, 0) (0, 1, 1) [4] is considered the best for total electricity production by using R software which supports the automatic search on determining the parameters.
The HW model is a triple exponential smoothing forecast method that has the capability to handle trend and seasonality effectively. It is a five-step process in which practitioners must calculate the specific factors at each step (1) seasonal indices; (2) overall smoothing of trend level; (3) trend factor; (4) smoothing of seasonal indices; and (5) generating of forecasts. Both multiplicative HW and additive HW models are used.
The ANN model can be divided into four stages: (1) the topology design stage consist of the choice of the ANN type, the number of its layers, the number of neurons in each layer, its inputs and outputs, the selection of training, validation and test samples; (2) in the training stage, the neural network learns the potential relationship between input variables and output variable, and the learning process is continued till it finds the minimum error; (3) in the validation stage, connection weights are adjusted with unknown data; and (4) the test stage is applied to evaluate the performance of the net with the test sample. In this study, 107 observations for training set, 12 observations for validation set and 12 observations for testing set were used. The training dataset are used to learn or develop candidate models, validation set used to select a model and test set is used for assessing model performance on future data. Table 3 shows the validation set performance of the ANN based on sARIMA.
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 4 | 262.138 |
2 | 2 | 9 | 261.9352 |
3 | 5 | 2 | 233.5972 |
4 | 6 | 4 | 243.0433 |
As results reported in Table 3 show that one to four lagged of total electricity production are used as input variables. The ANN (I × H1 × H2 × O) notations were used to represent the number of input variables (I), hidden units in each layer (H1 and H2), and the output unit (O). Finally, ANN (3 × 5 × 2 × 1) model used to forecast the non-linear part of the data.
The validation set performance of ANN based on multiplicative HW model is given in Table 4. Based on the results of Table 4, one to four lagged total electricity production are chosen as input variables. ANN (3 × 8 × 9 × 1) model used to forecast the non-linear part of the data.
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 6 | 242.2049 |
2 | 4 | 8 | 245.0679 |
3 | 8 | 9 | 230.3607 |
4 | 1 | 7 | 233.0093 |
The validation set performance of ANN based on multiplicative HW model is given in Table 5. According to results of Table 5, it is evident that the number of input variables is three and the best model is chosen as ANN (3 × 8 × 3 × 1) model used to forecast the non-linear part of the data.
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 6 | 217.1001 |
2 | 2 | 9 | 218.1105 |
3 | 8 | 3 | 200.2002 |
4 | 1 | 7 | 205.1442 |
In regard to ANFIS model, it is applied by using the MATLAB program. In this study, one factor – the lagged values of total electricity production itself –was used as inputs for forecasting the total electricity production. The first 117 of data is used as training set to optimize the model parameters, 12 of data is used as validation set and the last 12 serves as the test set. The used steps are as follows:
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 6 | 217.1001 |
2 | 2 | 9 | 218.1105 |
3 | 8 | 3 | 200.2002 |
4 | 1 | 7 | 205.1442 |
First, in order to determine that the number of input variables and the number of clusters for fuzzy c-means clustering (FCM), the ANFIS based sARIMA model has been conducted in terms of validation sets. The selection is made by minimum root mean square performance metrics (RMSE). Figure 7 shows the best model with three input variables (including 1, 2 and 3 lagged total electricity production, respectively) and FCM is equal to three.
The same process is applied to the ANFIS based on multiplicative HW model and the ANFIS based on additive HW model to find out whether the hybrid models perform better than simple linear models. Figure 8 and Figure 9 show the results of the ANFIS based on multiplicative HW model and the ANFIS based on additive HW model, respectively. The best model for the ANFIS based on multiplicative HW model is selected with minimum RMSE that is the one with one input variable (one lagged total electricity production) and three numbers of clusters. The validation set results for the ANFIS based on additive HW model show that three input variables (including 1, 2 and 3 lagged total electricity production respectively) and five numbers of clusters are selected according to RMSE measure.
After forecasting the linear and non-linear parts of the data, the forecasts by hybrid models are calculated. Table 6 shows the out-of-sample forecasting of all models involved in this study. In order to evaluate the model performances, the following three indices, RMSE, the mean absolute error (MAE) and mean absolute scale error (MASE) were applied in this study. The RMSE, MAE and MASE are defined by following equation:
RMSE=√∑ni=1(Ai−Fi)2n | (10) |
MAE=∑ni=1|Ai−Fi|n | (11) |
MASE=∑ni=1|Ai−Fi|nn−m∑ni=m+1|Ai−Ai−m| | (12) |
Models | RMSE | MAE | MASE |
sARIMA | 1,188 | 1,085 | 0,198 |
Multiplicative HW | 0,813 | 0,727 | 0,133 |
Additive HW | 0,888 | 0,807 | 0,148 |
ANN based on sARIMA | 1,145 | 1,048 | 0,192 |
ANFIS based on sARIMA | 1,176 | 1,076 | 0,197 |
ANN based on Multiplicative HW | 0,543 | 0,460 | 0,084 |
ANFIS based on Multiplicative HW | 0,902 | 0,814 | 0,149 |
ANN based on Additive HW | 0,629 | 0,493 | 0,090 |
ANFIS based on Additive HW | 1,130 | 0,977 | 0,179 |
Naïve | 1,605 | 1,552 | 0,284 |
Note: Naïve model is random walk model and is used as benchmark model. Entries represent the forecasting accuracy metrics values (divided by 1000 for RMSE and MAE). |
where Ai presents the actual value, Fi presents the forecast value and m represents the seasonal period.
The results from Table 6 indicate that the hybrid models by using the ANN model are superior to simple models, such as the sARIMA model and both multiplicative and additive HW models. However, the hybrid models by using ANFIS model falls behind the simple models in terms of forecasting performance. This finding supports Makridakis et al. [1]'s finding, as they mentioned that simple model is better than complex model. However, if the hybrid model including the ANN model is used for forecasting the total electricity production, it is obvious that the forecast accuracy can be improved. In addition, Diebold- Mariano (DM) statistical test proposed by Diebold and Mariano [55] was employed to show whether there is any statistical difference between the models in terms of forecasting performances.
According to Table 6 and 7, the best forecasting performance is achieved by using the ANN based on multiplicative HW hybrid model. Moreover, the ANN based additive HW model can be chosen for forecasting the total electricity production in South Africa. The results confirm that the hybrid models are better than the univariate time series models for the total electricity production forecasting in South Africa. Figure 10 shows the plot of forecasts by the simple models and the hybrid models.
Model 2/Model 1 | ANN based on Multiplicative HW |
sARIMA | −5.290 (0.000)* |
ANN based on sARIMA | −5.465 (0.000)* |
ANFIS based sARIMA | −5.247 (0.000)* |
HW Multiplicative | −4.555 (0.000)* |
HW Additive | −4.089 (0.000)* |
ANN based on Additive HW | −0.251 (0.403) |
ANFIS based on Multiplicative HW | −5.446 (0.000)* |
ANFIS based on Additive HW | −2.655 (0.011)** |
Naïve | −6.364 (0.000)* |
Note: The null hypothesis is that Model 1 is less accurate than Model 2. Model 1 represents. ANN based on Multiplicative HW. * and ** indicate that the null hypothesis is rejected by 1% and 5% significance level respectively. |
In addition to Figure 10, Figure 11 also shows that the success of the hybrid models can be attributed to its ability to more closely track the actual values than simple models. The hybrid models show better forecasting performance in almost all forecast point. Figure 12 also confirms that the hybrid models produce more accurate forecasts than the simple models in terms of the accuracy metrics.
The hybrid models are capable of improving the forecasting performances where simple models fall short in terms of forecasting accuracy. Ten different forecasting models, the sARIMA model, the multiplicative HW model, the additive HW model, the ANN based on sARIMA model, the ANN based on multiplicative HW model, the ANN based on additive HW model, the ANFIS based on sARIMA model, the ANFIS based on multiplicative HW model, the ANFIS based on additive HW model, and the naïve model as benchmark, were applied to forecast the total electricity production data. For this study, three different models are considered for forecasting with simple models, the ANN based linear models and the ANFIS based linear models. The forecasting results show that the best linear model is the multiplicative HW model and the best hybrid model is the ANN based on multiplicative HW model. This result is compatible with the expectation which may sign that the selection of the best linear model from the other linear models in terms of out-of-sample forecasting performance is significantly important when combining the models. The ANN based on multiplicative HW model, which shows the higher forecasting accuracy has superior capability in modeling the total electricity production over the linear models in terms of all the evaluation criteria. Thus, the hybrid model not only provides better forecasting performance, but also shows better statistical interpretation. However, this accuracy is achieved at the expense of computational complexity. Hence, it is recommended to use ANN based linear models only in those models in which the hybrid models are useful in extracting advantages of individual models for forecasting the total electricity production with higher forecasting accuracy. Especially, since the ANN based both multiplicative and additive HW models have the best forecasting performance, an interesting direction of further research is to investigate whether both multiplicative and additive HW models can be efficiently combined with the other nonlinear models. Additionally, the success of the hybrid models depends on how well the linear and nonlinear components are forecasted. This finding points out the usability of different linear and nonlinear forecasting models which give new incentives for further researches.
The author declares no conflict of interest in this paper.
[1] |
A. Corcoran, R, F. Del Maestro, Testing the "go or grow" hypothesis in human medulloblastoma cell lines in two and three dimensions, Neurosurgery, 53 (2003), 174-185. doi: 10.1227/01.NEU.0000072442.26349.14
![]() |
[2] |
A. Giese, M. A. Loo, N. Tran, D. Haskett, S. W. Coons, M. E. Berens, Dichotomy of astrocytoma migration and proliferation, Int. J. Cancer, 67 (1996), 275-282. doi: 10.1002/(SICI)1097-0215(19960717)67:2<275::AID-IJC20>3.0.CO;2-9
![]() |
[3] |
A. Giese, R. Bjerkvig, M. E. Berens, M. Westphal, Cost of migration: invasion of malignant gliomas and implications for treatment, J. Clin. Oncol., 21 (2003), 1624-1636. doi: 10.1200/JCO.2003.05.063
![]() |
[4] |
K. S. Hoek, O. M. Eichhoff, N. C. Schlegel, U. Dobbeling, N. Kobert, L. Schaerer, et al., In vivo switching of human melanoma cells between proliferative and invasive states, Cancer Res., 68 (2008), 650-656. doi: 10.1158/0008-5472.CAN-07-2491
![]() |
[5] |
J. Godlewski, M. O. Nowicki, A. Bronisz, G. Nuovo, J. Palatini, M. D. Lay, et al., MicroRNA-451 regulates LKB1/AMPK signaling and allows adaptation to metabolic stress in glioma cells, Mol. Cell, 37 (2010), 620-632. doi: 10.1016/j.molcel.2010.02.018
![]() |
[6] |
L. Jerby, L. Wolf, C. Denkert, G. Y. Stein, M. Hilvo, M. Oresic, et al., Metabolic associations of reduced proliferation and oxidative stress in advanced breast cancer, Cancer Res., 72 (2012), 5712-5720. doi: 10.1158/0008-5472.CAN-12-2215
![]() |
[7] | J. Metzcar, Y. Wang, R. Heiland, P. Macklin, A review of cell-based computational modeling in cancer biology, JCO Clin. Cancer Inf., (2019), 1-13. |
[8] | J. A. Gallaher, J. S. Brown, A. R. A. Anderson, The impact of proliferation-migration tradeoffs on phenotypic evolution in cancer, Sci. Rep., 9 (2019), 1-10. |
[9] |
M. Tektonidis, H. Hatzikirou, A. Chauvière, M. Simon, K. Schaller, A. Deutsch, Identification of intrinsic in vitro cellular mechanisms for glioma invasion, J. Theor. Biol., 287 (2011), 131-147. doi: 10.1016/j.jtbi.2011.07.012
![]() |
[10] |
T. Garay, É. Juhász, E. Molnár, M. Eisenbauer, A. Czirók, B. Dekan, et al., Cell migration or cytokinesis and proliferation? - Revisiting the "go or grow" hypothesis in cancer cells in vitro, Exp. Cell Res., 319 (2013), 3094-3103. doi: 10.1016/j.yexcr.2013.08.018
![]() |
[11] |
S. T. Vittadello, S. W. McCue, G. Gunasingh, N. K. Haass, M. J. Simpson, Examining go-or-grow using fluorescent cell-cycle indicators and cell-cycle-inhibiting drugs, Biophys. J., 118 (2020), 1243-1247. doi: 10.1016/j.bpj.2020.01.036
![]() |
[12] |
J. C. L. Alfonso, K. Talkenberger, M. Seifert, B. Klink, A. Hawkins-Daarud, K. R. Swanson, et al., The biology and mathematical modelling of glioma invasion: a review, J. R. Soc. Interface, 14 (2017), 20170490. doi: 10.1098/rsif.2017.0490
![]() |
[13] | K. Böttger, H. Hatzikirou, A. Chauviere, A. Deutsch, Investigation of the migration/proliferation dichotomy and its impact on avascular glioma invasion, Math. Model. Nat. Phenom., 7 (2012), 105-135. |
[14] |
K. Böttger, H. Hatzikirou, A. Voss-Böhme, E. A. Cavalcanti-Adam, M. A. Herrero, A. Deutsch, An emerging Allee effect is critical for tumor initiation and persistence, PLOS Comput. Biol., 11 (2015), e1004366. doi: 10.1371/journal.pcbi.1004366
![]() |
[15] | P. Gerlee, S. Nelander, The impact of phenotypic switching on glioblastoma growth and invasion, PLoS Comput. Biol., 8 (2012). |
[16] |
H. Hatzikirou, D. Basanta, M. Simon, K. Schaller, A. Deutsch, 'Go or Grow': the key to the emergence of invasion in tumour progression?, Math. Med. Biol., 29 (2012), 49-65. doi: 10.1093/imammb/dqq011
![]() |
[17] | H. N. Weerasinghe, P. M. Burrage, K. Burrage, D. V. Nicolau, Mathematical models of cancer cell plasticity, J. Oncol., (2019), 1-C. |
[18] | Y. Kim, H. Kang, S. Lawler, The role of the miR-451-AMPK signaling pathway in regulation of cell migration and proliferation in glioblastoma, in Mathematical Models of Tumor-Immune System Dynamics, Springer, New York, (2014), 25-155. |
[19] |
A. V. Kolobov, V. V. Gubernov, A. A. Polezhaev, Autowaves in the model of infiltrative tumour growth with migration-proliferation dichotomy, Math. Model. Nat. Phenom., 6 (2011), 27-38. doi: 10.1051/mmnp/20116703
![]() |
[20] |
K. Pham, A. Chauviere, H. Hatzikirou, X. Li, H. M. Byrne, V. Cristini, et al., Density-dependent quiescence in glioma invasion: instability in a simple reaction-diffusion model for the migration/proliferation dichotomy, J. Biol. Dyn., 6 (2012), 54-71. doi: 10.1080/17513758.2011.590610
![]() |
[21] |
O. Saut, J. B. Lagaert, T. Colin, H. M. Fathallah-Shaykh, A multilayer grow-or-go model for GBM: effects of invasive cells and anti-angiogenesis on growth, Bull. Math. Biol., 76 (2014), 2306-2333. doi: 10.1007/s11538-014-0007-y
![]() |
[22] |
T. L. Stepien, E. M. Rutter, Y. Kuang, Traveling waves of a go-or-grow model of glioma growth, SIAM J. Appl. Math., 78 (2018), 1778-1801. doi: 10.1137/17M1146257
![]() |
[23] |
C. Stinner, C. Surulescu, A. Uatay, Global existence for a go-or-grow multiscale model for tumor invasion with therapy, Math. Model. Methods Appl. Sci., 26 (2016), 2163-2201. doi: 10.1142/S021820251640011X
![]() |
[24] |
E. Scribner, O. Saut, P. Province, A. Bag, T. Colin, H. M. Fathallah-Shaykh, Effects of anti-angiogenesis on glioblastoma growth and migration: model to clinical predictions, PLoS One, 9 (2014), e115018. doi: 10.1371/journal.pone.0115018
![]() |
[25] | A. Zhigun, C. Surulescu, A. Hunt, A strongly degenerate diffusion-haptotaxis model of tumour invasion under the go-or-grow dichotomy hypothesis, Math. Methods Appl. Sci., 41 (2018), 2403-2428. |
[26] | M. A. Lewis, G. Schmitz, Biological invasion of an organism with separate mobile and stationary states: Modeling and analysis, Forma, 11 (1996), 1-25. |
[27] | X. Ma, M. E. Schickel, M. D. Stevenson, A. L. Sarang-Sieminski, K. J. Gooch, S. N. Ghadiali, et al., Fibres in the extracellular matrix enable long-range stress transmission between cells, Biophys. J., 104 (2013), 1410-1418. |
[28] |
N. J. Armstrong, K. J. Painter, J. A. Sherratt, A continuum approach to modelling cell-cell adhesion, J. Theor. Biol., 243 (2006), 98-113. doi: 10.1016/j.jtbi.2006.05.030
![]() |
[29] |
T. Alzahrani, R. Eftimie, D. Trucu, Multiscale modelling of cancer response to oncolytic viral therapy, Math. Biosci., 310 (2019), 76-95. doi: 10.1016/j.mbs.2018.12.018
![]() |
[30] | J. Ahn, M. Chae, J. Lee, Nonlocal adhesion models for two cancer cell phenotypes in a multidimensional bounded domain, Z. Angew. Math. Phys., 72 (2021), 1-28. |
[31] | M. Eckardt, K. J. Painter, C. Surulescu, A. Zhigun, Nonlocal and local models for taxis in cell migration: a rigorous limit procedure, J. Math. Biol., 81 (2020), 1251-1298. |
[32] |
R. Alemany, Viruses in cancer treatment, Clin. Transl. Oncol., 15 (2013), 182-188. doi: 10.1007/s12094-012-0951-7
![]() |
[33] |
M. Vähä-Koskela, A. Hinkkanen, Tumor restrictions to oncolytic virus, Biomedicines, 2 (2014), 163-194. doi: 10.3390/biomedicines2020163
![]() |
[34] | A. Alsisi, R. Eftimie, D. Trucu, Non-local multiscale approaches for tumour-oncolytic viruses interactions, Math. Appl. Sci. Eng., 99 (2020), 1-27. |
[35] |
D. Trucu, P. Lin, M. A. J. Chaplain, Y. Wang, A multiscale moving boundary model Arising in cancer invasion, Multiscale Model. Simul., 11 (2013), 309-335. doi: 10.1137/110839011
![]() |
[36] |
N. Bhagavathula, A. W. Hanosh, K. C. Nerusu, H. Appelman, S. Chakrabarty, J. Varani, Regulation of E-cadherin and β-catenin by Ca2+ in colon carcinoma is dependent on calcium-sensing receptor expression and function, Int. J. Cancer, 121 (2007), 1455-1462. doi: 10.1002/ijc.22858
![]() |
[37] |
U. Cavallaro, G. Christofori, Cell adhesion in tumor invasion and metastasis: loss of the glue is not enough, Biochim. Biophys. Acta Rev. Cancer, 1552 (2001), 39-45. doi: 10.1016/S0304-419X(01)00038-5
![]() |
[38] |
J. D. Humphries, Integrin ligands at a glance, J. Cell Sci., 119 (2006), 3901-3903. doi: 10.1242/jcs.03098
![]() |
[39] |
K. S. Ko, P. D. Arora, V. Bhide, A. Chen, C. A. McCulloch, Cell-cell adhesion in human fibroblasts requires calcium signaling, J. Cell Sci., 114 (2001), 1155-1167. doi: 10.1242/jcs.114.6.1155
![]() |
[40] | B. P. L. Wijnhoven, W. N. M. Dinjens, M. Pignatelli, E-cadherin-catenin cell-cell adhesion complex and human cancer, Br. J. Surg., 87 (2000), 992-1005. |
[41] |
M. A. J. Chaplain, G. Lolas, Mathematical modelling of cancer invasion of tissue: dynamic heterogeneity, Networks Heterog. Media, 1 (2006), 399-439. doi: 10.3934/nhm.2006.1.399
![]() |
[42] | M. J. Oudin, O. Jonas, T. Kosciuk, L. C. Broye, B. C. Guido, J. Wyckoff, et al., Tumor cell-driven extracellular matrix remodeling drives haptotaxis during metastatic progression, Cancer Discovery, 6 (2016), 516-531. |
[43] |
P. Domschke, D. Trucu, A. Gerisch, M. A. J. Chaplain, Mathematical modelling of cancer invasion: Implications of cell adhesion variability for tumour infiltrative growth patterns, J. Theor. Biol., 361 (2014), 41-60. doi: 10.1016/j.jtbi.2014.07.010
![]() |
[44] |
A. Gerisch, M. A. J. Chaplain, Mathematical modelling of cancer cell invasion of tissue: Local and non-local models and the effect of adhesion, J. Theor. Biol., 250 (2008), 684-704. doi: 10.1016/j.jtbi.2007.10.026
![]() |
[45] |
Z. Gu, F. Liu, E. A. Tonkova, S. Y. Lee, D. J. Tschumperlin, M. B. Brenner, Soft matrix is a natural stimulator for cellular invasiveness, Mol. Biol. Cell, 25 (2014), 457-469. doi: 10.1091/mbc.e13-05-0260
![]() |
[46] |
A. M. Hofer, S. Curci, M. A. Doble, E. M. Brown, D. I. Soybel, Intercellular communication mediated by the extracellular calcium-sensing receptor, Nat. Cell Biol., 2 (2000), 392-398. doi: 10.1038/35017020
![]() |
[47] |
R. Shuttleworth, D. Trucu, Multiscale modelling of fibres dynamics and cell adhesion within moving boundary cancer invasion, Bull. Math. Biol., 81 (2019), 2176-2219. doi: 10.1007/s11538-019-00598-w
![]() |
[48] | C. Guiot, P. Degiorgis, P. Delsanto, P. Gabriele, T. Diesboeck, Does tumour growth follow a "universal law"?, J. Theor. Biol., 225 (2003), 147-151. |
[49] | A. Laird, Dynamics of tumour growth, Br. J. Cancer, 18 (1964), 490-502. |
[50] |
D. Hanahan, R. A. Weinberg, Hallmarks of cancer: The next generation, Cell, 144 (2011), 646-674. doi: 10.1016/j.cell.2011.02.013
![]() |
[51] | R. A. Weinberg, The Biology of Cancer, Garland Science, New York, 2006. |
[52] | P. R. Halmos, Measure Theory, Springer-Verlag, New York, USA, 1974. |
[53] | S. Suveges, R. Eftimie, D. Trucu, Directionality of macrophages movement in tumour invasion: A multiscale moving-boundary approach, Bull. Math. Biol., 82 (2020), 1-48. |
[54] |
B. I. Camara, H. Mokrani, E. Afenya, Mathematical modeling of glioma therapy using oncolytic viruses, Math. Biosci. Eng., 10 (2013), 565-578. doi: 10.3934/mbe.2013.10.565
![]() |
[55] |
K. J. Painter, N. J. Armstrong, J. A. Sherratt, The impact of adhesion on cellular invasion processes in cancer and development, J. Theor. Biol., 264 (2010), 1057-1067. doi: 10.1016/j.jtbi.2010.03.033
![]() |
[56] |
R. Shuttleworth, D. Trucu, Multiscale dynamics of a heterotypic cancer cell population within a fibrous extracellular matrix, J. Theor. Biol., 486 (2020), 110040. doi: 10.1016/j.jtbi.2019.110040
![]() |
[57] |
L. Peng, D. Trucu, P. Lin, A. Thompson, M. A. J. Chaplain, A multiscale mathematical model of tumour invasive growth, Bull. Math. Biol., 79 (2017), 389-429. doi: 10.1007/s11538-016-0237-2
![]() |
[58] |
T. Alzahrani, R. Eftimie, D. Trucu, Multiscale moving boundary modelling of cancer interactions with a fusogenic oncolytic virus: The impact of syncytia dynamics, Math. Biosci., 323 (2020), 108296. doi: 10.1016/j.mbs.2019.108296
![]() |
1. | SITI NOR ZULAIKA AMRAN, NORIZAN MOHAMED, FORECASTING ELECTRICITY SUPPLIED IN TURKEY USING HOLT-WINTERS’ MULTIPLICATIVE METHOD AND ARTIFICIAL NEURAL NETWORK (ANN) MODELS, 2021, 3, 2637-1138, 131, 10.46754/umtjur.v3i3.225 | |
2. | Weibiao Qiao, Zhaoyang Li, Wei Liu, Enbin Liu, Fastest‐growing source prediction of US electricity production based on a novel hybrid model using wavelet transform, 2022, 46, 0363-907X, 1766, 10.1002/er.7293 | |
3. | Benjamin P. Fram, Ritvana Rrukaj, Leif K. Sandal, 2022, Risk Management in Wholesale Electricity Markets: A Signal Processing Approach, 978-1-6654-0971-1, 1, 10.1109/REST54687.2022.10022333 | |
4. | Esso-wazam Honoré Tchandao, Adekunlé Akim Salami, Koffi Mawugno Kodjo, 2022, Modeling the Probability Distribution of Electrical Power Importations Using Kernel Density Estimation Model, 978-1-6654-6119-1, 1, 10.1109/HiTech56937.2022.10145546 | |
5. | Shavkatjon Tulkinov, Grey forecast of electricity production from coal and renewable sources in the USA, Japan and China, 2023, 13, 2043-9377, 517, 10.1108/GS-10-2022-0107 |
Mean | Maximum | Minimum | Std. Dev. | Skewness | Kurtosis |
17800.330 | 22868.000 | 11073.000 | 3442.301 | −0.287 | 1.676 |
LB-Q (12) | p-value | J-B Test | p-value | ARCH (12) | p-value |
1207.133 | 0.000 | 11.36815 | 0.0034 | 3.687 | 0.978 |
Hybrid Methodology |
1. Given a time series x = [x1, x2, …, xn]. 2. Determine the training set (82%) and validation test (9%) size as in-sample, and test size (9%) as out-of-sample. 3. Normalize the time series data using min-max normalization using the following formula: zi = (xi-min(x))/(max(x)-min(x)) 4. Determine the best simple individual models and their parameters using the normalized vector in-sample period. 5. Obtain the forecasts using selected simple individual models. 6. De-normalize the forecasts to find ^Lt in Equation (7). 7. Obtain residuals (εt) by subtracting simple models' forecasts from actual values in Equation (7). 8. Select the best lag number to determine the number of inputs on the data of residual series. 9. Normalize the residual series using min-max normalization. 10. Obtain forecasts using ANN and ANFIS models. 11. De-normalize the forecasts to obtain ^Nt from Equation (8). 12. Combine the simple models' forecasts with ANN and ANFIS models' forecasts as in Equation (9). |
input: εi,j:the errors obtained by subtracting jth model predictions from actual values, ζm: model building period (training period), ζv: validation period, ζt: test period, i: the number of observed predictions, j: the number of univariate models, h1(thenumberofnodesinfirsthiddenlayer)=c(1,2,…,12), h2(thenumberofnodesinsecondhiddenlayer)=c(1,2,…,12), k: the time delay number of εi,j. |
1: Given εi,j, ζm, ζv 2: foreach j do foreach k do create datamatrix =cbind(Lag((εi,j,k=1,..4), εi,j) end for end for 3: Run neural network model {εi,j;ζm,h1,h2,ε′i,j;ζm} 4: Compute MAEζv,ε′i,j,h1,h2 5: Save minimum MAEζv,ε′i,j,h1,h2 to determine the optimum number of nodes in each hidden layer 6: Run neural network model {εi,j;ζm+ζv,h1,h2,ε′i,j;ζm} output: Optimal model − optimal ε′i,j with optimum h1 and h2 |
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 4 | 262.138 |
2 | 2 | 9 | 261.9352 |
3 | 5 | 2 | 233.5972 |
4 | 6 | 4 | 243.0433 |
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 6 | 242.2049 |
2 | 4 | 8 | 245.0679 |
3 | 8 | 9 | 230.3607 |
4 | 1 | 7 | 233.0093 |
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 6 | 217.1001 |
2 | 2 | 9 | 218.1105 |
3 | 8 | 3 | 200.2002 |
4 | 1 | 7 | 205.1442 |
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 6 | 217.1001 |
2 | 2 | 9 | 218.1105 |
3 | 8 | 3 | 200.2002 |
4 | 1 | 7 | 205.1442 |
Models | RMSE | MAE | MASE |
sARIMA | 1,188 | 1,085 | 0,198 |
Multiplicative HW | 0,813 | 0,727 | 0,133 |
Additive HW | 0,888 | 0,807 | 0,148 |
ANN based on sARIMA | 1,145 | 1,048 | 0,192 |
ANFIS based on sARIMA | 1,176 | 1,076 | 0,197 |
ANN based on Multiplicative HW | 0,543 | 0,460 | 0,084 |
ANFIS based on Multiplicative HW | 0,902 | 0,814 | 0,149 |
ANN based on Additive HW | 0,629 | 0,493 | 0,090 |
ANFIS based on Additive HW | 1,130 | 0,977 | 0,179 |
Naïve | 1,605 | 1,552 | 0,284 |
Note: Naïve model is random walk model and is used as benchmark model. Entries represent the forecasting accuracy metrics values (divided by 1000 for RMSE and MAE). |
Model 2/Model 1 | ANN based on Multiplicative HW |
sARIMA | −5.290 (0.000)* |
ANN based on sARIMA | −5.465 (0.000)* |
ANFIS based sARIMA | −5.247 (0.000)* |
HW Multiplicative | −4.555 (0.000)* |
HW Additive | −4.089 (0.000)* |
ANN based on Additive HW | −0.251 (0.403) |
ANFIS based on Multiplicative HW | −5.446 (0.000)* |
ANFIS based on Additive HW | −2.655 (0.011)** |
Naïve | −6.364 (0.000)* |
Note: The null hypothesis is that Model 1 is less accurate than Model 2. Model 1 represents. ANN based on Multiplicative HW. * and ** indicate that the null hypothesis is rejected by 1% and 5% significance level respectively. |
Mean | Maximum | Minimum | Std. Dev. | Skewness | Kurtosis |
17800.330 | 22868.000 | 11073.000 | 3442.301 | −0.287 | 1.676 |
LB-Q (12) | p-value | J-B Test | p-value | ARCH (12) | p-value |
1207.133 | 0.000 | 11.36815 | 0.0034 | 3.687 | 0.978 |
Hybrid Methodology |
1. Given a time series x = [x1, x2, …, xn]. 2. Determine the training set (82%) and validation test (9%) size as in-sample, and test size (9%) as out-of-sample. 3. Normalize the time series data using min-max normalization using the following formula: zi = (xi-min(x))/(max(x)-min(x)) 4. Determine the best simple individual models and their parameters using the normalized vector in-sample period. 5. Obtain the forecasts using selected simple individual models. 6. De-normalize the forecasts to find ^Lt in Equation (7). 7. Obtain residuals (εt) by subtracting simple models' forecasts from actual values in Equation (7). 8. Select the best lag number to determine the number of inputs on the data of residual series. 9. Normalize the residual series using min-max normalization. 10. Obtain forecasts using ANN and ANFIS models. 11. De-normalize the forecasts to obtain ^Nt from Equation (8). 12. Combine the simple models' forecasts with ANN and ANFIS models' forecasts as in Equation (9). |
input: εi,j:the errors obtained by subtracting jth model predictions from actual values, ζm: model building period (training period), ζv: validation period, ζt: test period, i: the number of observed predictions, j: the number of univariate models, h1(thenumberofnodesinfirsthiddenlayer)=c(1,2,…,12), h2(thenumberofnodesinsecondhiddenlayer)=c(1,2,…,12), k: the time delay number of εi,j. |
1: Given εi,j, ζm, ζv 2: foreach j do foreach k do create datamatrix =cbind(Lag((εi,j,k=1,..4), εi,j) end for end for 3: Run neural network model {εi,j;ζm,h1,h2,ε′i,j;ζm} 4: Compute MAEζv,ε′i,j,h1,h2 5: Save minimum MAEζv,ε′i,j,h1,h2 to determine the optimum number of nodes in each hidden layer 6: Run neural network model {εi,j;ζm+ζv,h1,h2,ε′i,j;ζm} output: Optimal model − optimal ε′i,j with optimum h1 and h2 |
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 4 | 262.138 |
2 | 2 | 9 | 261.9352 |
3 | 5 | 2 | 233.5972 |
4 | 6 | 4 | 243.0433 |
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 6 | 242.2049 |
2 | 4 | 8 | 245.0679 |
3 | 8 | 9 | 230.3607 |
4 | 1 | 7 | 233.0093 |
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 6 | 217.1001 |
2 | 2 | 9 | 218.1105 |
3 | 8 | 3 | 200.2002 |
4 | 1 | 7 | 205.1442 |
Validation Set Performance | |||
Lag | Hidden Layer 1 | Hidden Layer 2 | MAE |
1 | 9 | 6 | 217.1001 |
2 | 2 | 9 | 218.1105 |
3 | 8 | 3 | 200.2002 |
4 | 1 | 7 | 205.1442 |
Models | RMSE | MAE | MASE |
sARIMA | 1,188 | 1,085 | 0,198 |
Multiplicative HW | 0,813 | 0,727 | 0,133 |
Additive HW | 0,888 | 0,807 | 0,148 |
ANN based on sARIMA | 1,145 | 1,048 | 0,192 |
ANFIS based on sARIMA | 1,176 | 1,076 | 0,197 |
ANN based on Multiplicative HW | 0,543 | 0,460 | 0,084 |
ANFIS based on Multiplicative HW | 0,902 | 0,814 | 0,149 |
ANN based on Additive HW | 0,629 | 0,493 | 0,090 |
ANFIS based on Additive HW | 1,130 | 0,977 | 0,179 |
Naïve | 1,605 | 1,552 | 0,284 |
Note: Naïve model is random walk model and is used as benchmark model. Entries represent the forecasting accuracy metrics values (divided by 1000 for RMSE and MAE). |
Model 2/Model 1 | ANN based on Multiplicative HW |
sARIMA | −5.290 (0.000)* |
ANN based on sARIMA | −5.465 (0.000)* |
ANFIS based sARIMA | −5.247 (0.000)* |
HW Multiplicative | −4.555 (0.000)* |
HW Additive | −4.089 (0.000)* |
ANN based on Additive HW | −0.251 (0.403) |
ANFIS based on Multiplicative HW | −5.446 (0.000)* |
ANFIS based on Additive HW | −2.655 (0.011)** |
Naïve | −6.364 (0.000)* |
Note: The null hypothesis is that Model 1 is less accurate than Model 2. Model 1 represents. ANN based on Multiplicative HW. * and ** indicate that the null hypothesis is rejected by 1% and 5% significance level respectively. |