Multi-directional gated recurrent unit and convolutional neural network for load and energy forecasting: A novel hybridization

Fazeel Abid; Muhammad Alam; Faten S. Alamri; Imran Siddique; Fazeel Abid; Muhammad Alam; Faten S. Alamri; Imran Siddique

doi:10.3934/math.20231019

AIMS Mathematics

2023, Volume 8, Issue 9: 19993-20017. doi: 10.3934/math.20231019

Previous Article Next Article

Research article Special Issues

Multi-directional gated recurrent unit and convolutional neural network for load and energy forecasting: A novel hybridization

1.
Department of Information Systems, University of Management and Technology, Lahore 54770, Pakistan
2.
Department of Computer Science and Informatics, London South Bank University, UK
3.
Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
4.
Department of Mathematics, University of Management and Technology, Lahore 54770, Pakistan

Received: 19 February 2023 Revised: 21 May 2023 Accepted: 23 May 2023 Published: 15 June 2023
MSC : 62J02, 62J99

Energy operations and schedules are significantly impacted by load and energy forecasting systems. An effective system is a requirement for a sustainable and equitable environment. Additionally, a trustworthy forecasting management system enhances the resilience of power systems by cutting power and load-forecast flaws. However, due to the numerous inherent nonlinear properties of huge and diverse data, the classical statistical methodology cannot appropriately learn this non-linearity in data. Energy systems can appropriately evaluate data and regulate energy consumption because of advanced techniques. In comparison to machine learning, deep learning techniques have lately been used to predict energy consumption as well as to learn long-term dependencies. In this work, a fusion of novel multi-directional gated recurrent unit (MD-GRU) with convolutional neural network (CNN) using global average pooling (GAP) as hybridization is being proposed for load and energy forecasting. The spatial and temporal aspects, along with the high dimensionality of the data, are addressed by employing the capabilities of MD-GRU and CNN integration. The obtained results are compared to baseline algorithms including CNN, Long Short-Term Memory (LSTM), Bidirectional Long Short-Term Memory (Bi-LSTM), Gated Recurrent Unit (GRU), and Bidirectional Gated Recurrent Unit (Bi-GRU). The experimental findings indicate that the proposed approach surpasses conventional approaches in terms of accuracy, Mean Absolute Percentage Error (MAPE), and Root Mean Square Error (RSME).

Keywords:

load and energy forecasting,
multi-directional gated recurrent unit (MD-GRU),
convolutional neural networks,
spatial and temporal,
high dimensionality,
long term dependencies

Citation: Fazeel Abid, Muhammad Alam, Faten S. Alamri, Imran Siddique. Multi-directional gated recurrent unit and convolutional neural network for load and energy forecasting: A novel hybridization[J]. AIMS Mathematics, 2023, 8(9): 19993-20017. doi: 10.3934/math.20231019

Related Papers:

[1]	Zhencheng Fan, Zheng Yan, Yuting Cao, Yin Yang, Shiping Wen . Enhancing skeleton-based human motion recognition with Lie algebra and memristor-augmented LSTM and CNN. AIMS Mathematics, 2024, 9(7): 17901-17916. doi: 10.3934/math.2024871
[2]	Chenmin Ni, Muhammad Fadhil Marsani, Fam Pei Shan, Xiaopeng Zou . Flood prediction with optimized gated recurrent unit-temporal convolutional network and improved KDE error estimation. AIMS Mathematics, 2024, 9(6): 14681-14696. doi: 10.3934/math.2024714
[3]	CW Chukwu, S. Y. Tchoumi, Z. Chazuka, M. L. Juga, G. Obaido . Assessing the impact of human behavior towards preventative measures on COVID-19 dynamics for Gauteng, South Africa: a simulation and forecasting approach. AIMS Mathematics, 2024, 9(5): 10511-10535. doi: 10.3934/math.2024514
[4]	Mohammed Aljebreen, Hanan Abdullah Mengash, Khalid Mahmood, Asma A. Alhashmi, Ahmed S. Salama . Enhancing cybersecurity in cloud-assisted Internet of Things environments: A unified approach using evolutionary algorithms and ensemble learning. AIMS Mathematics, 2024, 9(6): 15796-15818. doi: 10.3934/math.2024763
[5]	Randa Allafi . An attention mechanism based recurrent neural network with dimensionality reduction model for cyber threat detection in IoT environment. AIMS Mathematics, 2025, 10(5): 11998-12031. doi: 10.3934/math.2025544
[6]	Abdelwahed Motwake, Aisha Hassan Abdalla Hashim, Marwa Obayya, Majdy M. Eltahir . Enhancing land cover classification in remote sensing imagery using an optimal deep learning model. AIMS Mathematics, 2024, 9(1): 140-159. doi: 10.3934/math.2024009
[7]	Nizar Alsharif, Mosleh Hmoud Al-Adhaileh, Mohammed Al-Yaari . Diagnosis of attention deficit hyperactivity disorder: A deep learning approach. AIMS Mathematics, 2024, 9(5): 10580-10608. doi: 10.3934/math.2024517
[8]	Xinyu Lu, Lifang Wang, Zejun Jiang, Shizhong Liu, Jiashi Lin . PEJL: A path-enhanced joint learning approach for knowledge graph completion. AIMS Mathematics, 2023, 8(9): 20966-20988. doi: 10.3934/math.20231067
[9]	Mansoor G. Al-Thani, Ziyu Sheng, Yuting Cao, Yin Yang . Traffic Transformer: Transformer-based framework for temporal traffic accident prediction. AIMS Mathematics, 2024, 9(5): 12610-12629. doi: 10.3934/math.2024617
[10]	Abdulkhaleq Q. A. Hassan, Badriyya B. Al-onazi, Mashael Maashi, Abdulbasit A. Darem, Ibrahim Abunadi, Ahmed Mahmud . Enhancing extractive text summarization using natural language processing with an optimal deep learning model. AIMS Mathematics, 2024, 9(5): 12588-12609. doi: 10.3934/math.2024616

Abstract

1. Introduction

The energy-management functions and timing have been greatly influenced by load and power prediction. An effective load forecasting and energy management system is essential to a fair and sustainable system. Many variables, including cost, load flow and power plant demand, depend on data collected by the load and power forecasting system ^[1,2]. Large energy storage and demand fluctuations also require a proactive balancing of load changes and energy management systems, as shown in ^[3,4]. A reliable load and power forecasting management system help reduce power and load forecast errors, enhance the robustness of power systems and mitigate changes in supply and demand ^[5,6].

Conventional statistical approaches including linear regression analysis (LR) approach and autoregressive moving average (ARMA) methodology have been previously used in pregnancy prediction methods based on massive amounts of data ^[7,8]. However, traditional statistical techniques cannot adequately learn these nonlinear data or match pregnancy prediction accuracy criteria due to many intrinsic nonlinear properties of big data ^[9,10]. Modern artificial intelligence technology enables energy systems to analyze data and manage demand properly. Machine learning methods have recently been used to predict power consumption and load patterns. These approaches lead to better control of load and power resulting in lower energy use, longer resource life and lower processing costs ^[11,12,13].

Currently, neural networks are being used more and more to tackle complex time-series prediction problems as a result of their recent success in challenges associated with computer vision and image processing. By comparing actual usage with forecasts from the forecasting system, accuracy can be improved as well as money and energy savings. Scheduling and planning of power systems can be damaged by inaccurate load forecasting which can lead to power shortages in the market and eventually increase the desire to expand existing Artificial Neural Network (ANN) designs ^[14,15]. By adding more hidden layers, which may handle nonlinearities in the input and output data, the extension is produced. The goal is to provide a Deep Neural Network (DNN) based power load prediction model for power usage prediction.

Initially, many neural networks used backpropagation techniques for short-term load forecasting as proposed in ^[16,17,18]. Similarly, the processing of sequence data is done in many works using a recurrent neural network proposed in ^[19,20,21]. Based on climatological data, a local and Global RNN model was developed to address the issue of long-term wind speed and energy forecasting ^[22,23]. However, during error backpropagation, the gradient descent problem still exists due to the enormous depth of time in the traditional Recurrent Neural Network (RNN) architecture which cannot learn the long-term historical data. Comparatively, long short-term memory (LSTM), an improved form of RNN was introduced in ^[24] which uses gated mechanisms to integrate short-term and long-term memory.

Latterly, Gated Recurrent Unit (GRU), based on an optimized LSTM combines the input gate and forget LSTM gate into a single update gate ^[25,26]. Still, the GRU model is not enough, because, the power system additionally contains high-dimensional data such as image data, spatiotemporal matrices and sequence data. These optimized models are not sufficient to handle high-dimensional data effectively. By contrast, a CNN is suitable for processing large amounts of high-dimensional data and has been successfully applied in many research works ^[27,28]. The CNN may record the characteristics of the local orientation and the non-variable features of the scale when there is a significant correlation between adjacent data points ^[29,30].

Accordingly, there are four categories of power load projections: long-term, medium-term, short-term and very short-term ^{[31,32,33,34]}. The main focus should be on very short-term electrical load and power forecasting, which predicts future loads and may assist power system operators set up realistic production plans, keeping supply and demand in balance, and assuring safety while minimizing resource waste and electricity prices. The conventional statistical methodology was proven to be ineffective for learning nonlinearity due to the multiple intrinsic nonlinear features of large and diverse data sets. Therefore, to predict energy consumption, machine and deep learning approaches have recently been applied to solve issues related with large dimensionality and learning long-term dependencies. A novel load and energy demand predictor using a multi-directional GRU (MD-GRU) and CNN using different pooling schemes has been proposed in this work. To improve the extraction and learning features in the chronological data, the MD-GRU is used to simulate the dynamic changes in the historical energy sequence data. The CNN module processes spatiotemporal matrices and plots them into distinct vectors by taking global values rather than max or average. The model predicts the load of power consumption 10 minutes in advance using several statistical techniques concerning predicted and actual values. These predicted values can be used to make wiser judgments and enhance the overall effectiveness of the system. The following are the main contributions of this paper:

1-A novel multi-directional GRU (MD-GRU) model is presented for the detection of dynamic changes in historical load and energy sequence data

2-This work integrates MD-GRU and CNN having global average pooling (GAP) is used to estimate the load and energy demand to capture long-term dependencies.

3-The proposed MD-GRU-CNN model is compared with the baseline models such as LSTM, GRU, Bi-LSTM, Bi-GRU, and CNN models

4-The proposed model is also assessed using other metrics including mean percentage absolute error (MAPE), root mean square error (RMSE), and conventional assessment measures.

The rest of the paper is structured as follows: Section 2 describes related work. Sections 3 and 4 contain the deep learning techniques for load and energy forecasting as well as the proposed methodology, while Section 5 contains the experimentation. Section 6 is for discussion, and Section 7 concludes the paper.

2. Related work

For the load and energy management system, there has been a major growth in the use of modern metering and sensors, which has greatly increased the information. A significant amount of data is produced, serving as a trustworthy source of information for load forecasting algorithms. There are parametric and non-parametric versions of these load forecasting models. While ANNs are among the most widely used non-parametric techniques for creating new energy demand predictions, parametric methodologies are based on analytical models ^[35,36]. There have been numerous electrical load forecasting models proposed in recent years using the latest deep learning techniques; some of the more popular models that researchers now employ are detailed in ^[37,38,39].

To forecast energy consumption over various time horizons, time-series approaches have been frequently used. The most prevalent model is the autoregressive moving average (ARMA) model, which combines the moving average (MA) and autoregressive (AR) models. The AR Model relies on the linear regression of actual values depending on a database, while the MA Model makes utilization of the moving average to compute Regressions employing forecast errors for previous values ^[40]. Another study uses autoregressive integrated moving average (ARIMA) models to create novel forecasting systems ^[41]. However, there are numerous intrinsic nonlinear properties in the enormous data, and classic statistical approaches cannot effectively learn this nonlinear data even though these methods may, to some extent, estimate short-term demand. Therefore, it is exceedingly difficult for conventional statistical approaches to forecast load with any degree of accuracy.

Various machine learning (ML) techniques, such as artificial neural networks fuzzy systems ^[42] and support vector machines (SVM) ^[43] have been effectively implemented in predicting and classifying issues ^[44,45,46]. The development of a system for load and energy forecasting using SVM and ML techniques is effective in addressing the difficulties of nonlinearity ^[47]. An adaptive neuro-fuzzy inference system (ANFIS), was used in another project to create a short-term load and energy forecasting model ^[48]. The backpropagation neural network (BPNN) has also been taken into consideration for load forecasting as deep learning has advanced ^[49]. A hybrid model of BPNN and the multi-label algorithm based on K-nearest neighbor (KNN) and K-means was presented; however, because of the nature of feedforward neural networks, it is unable to effectively learn time-sequence data for the power system ^[50].

Recently, computer vision, natural language processing and speech recognition have all benefited greatly from recent advancements in neural networks, especially deep neural networks. Deep neural network applications for short-term load forecasting are currently a novel and widely discussed topic. An analysis of artificial neural networks (ANNs) for predicting short-term load is shown in ^[51]. Recurrent neural networks (RNN), convolutional neural networks (CNN), and their combination and long short-term memory (LSTM) are just a few of the various building blocks that have made deep neural networks highly adaptable and successful ^[52].

A novel approach with LSTM and genetic algorithm (GA) integration was put out in ^[53], and it produced a mean absolute percentage error with a minimal value. An innovative method for the planning and operation of energy generation called the Bi-GRU (gated recurrent unit) prediction system was created ^[54]. The bidirectional characteristic of GRU, in contrast to other approaches, is not enough to gather long-term sequential information from all areas of energy and load forecasting. By using positive and negative input, the bidirectional model trains the nodes to operate in forward and reverse modes. The advantage of the bidirectional ability to focus in both directions (forward and backward), as indicated is useful for extracting more information from the long-term information sequence, but it is insufficient for energy and load forecasting ^[55]. Multiple load and energy forecasters have been produced in which the forecaster's framework must get more intricate to produce forecasting accuracy.

3. Deep learning techniques for load and energy forecasting model

3.1. Recurrent neural network

Sequential information use is carried out via recurrent neural networks. These networks connect words with particular time steps in the sequence. ^[56,57]. The amount of time steps determines the maximum input sequence length in the final result. Figure 1 depicts the recurrent neural network's architecture.

Figure 1. Recurrent neural network architecture.

Models	Narrative
CNN	Baseline Methods with Different Time Stamps
LSTM
GRU
Bi-LSTM
Bi-GRU

Models	Narrative
MD-GRU	Proposed Methods with Different Time Stamps
MD-GRU-CNN	Proposed Methods with Different Time Stamps

Accuracy
Models	DS-1	DS-2	DS-3
CNN	82.4	85.7	84.6
LSTM	83.8	87.1	87.9
GRU	86.99	88.12	88.71
Bi-LSTM	87.8	89.27	90
Bi-GRU	89.67	92.8	90.99

Recall
Models	DS-1	DS-2	DS-3
CNN	85.3	87.6	86.5
LSTM	88.2	92.1	89.2
GRU	87.87	91.75	90.25
Bi-LSTM	92.8	90.71	90.02
Bi-GRU	93.17	87.06	93.18

F1-Score
Models	DS-1	DS-2	DS-3
CNN	86.2	88.6	83.5
LSTM	90.2	93.4	88.4
GRU	89.71	92.15	91.05
Bi-LSTM	91.99	89.84	91.72
Bi-GRU	92.97	89.62	92.08

DS-1 (Average MAPE)
Models	2-Timestamps	4-Timestamps	6-Timestamps
CNN	40.5	39.5	41.5
LSTM	39.8	42.8	40.5
GRU	38.7	40.71	40.99
Bi-LSTM	37.8	39.1	38.99
Bi-GRU	37.9	35.4	35.01

DS-2 (Average MAPE)
Models	2-Timestamps	4-Timestamps	6-Timestamps
CNN	39.6	41.8	41.53
LSTM	38.41	40.3	38.55
GRU	38.34	39.91	37.56
Bi-LSTM	37.04	38.78	36.92
Bi-GRU	35.2	36.8	35.79

DS-3 (Average MAPE)
Models	2-Timestamps	4-Timestamps	6-Timestamps
CNN	38.6	42.4	41.03
LSTM	36.97	39.47	40.89
GRU	37.14	39.47	37.58
Bi-LSTM	37.04	38.14	39.87
Bi-GRU	36.2	35.49	35.17

Precision
Models	DS-1	DS-2	DS-3
MD-GRU	96.11	95.83	95.61
MD-GRU-CNN	97.27	96.99	98.55

DS-1, DS-2, DS-3 (Average MAPE)
Models	4-Timestamps	8-Timestamps	12-Timestamps
MD-GRU	26.9	28.93	26.74
	25.2	26.1	25.6
	25.87	26.99	27.12
MD-GRU-CNN	25.11	25.07	26.21
	26.17	25.18	26
	25.09	25.9	26.14

Models	DS-1	DS-2	DS-3
CNN	83.5	88.7	85.6
LSTM	85.8	90.21	87.9
GRU	89.77	90.97	89.45
Bi-LSTM	90	89.87	90.78
Bi-GRU	91.27	89.76	91.86

RMSE
Models	DS-1	DS-2	DS-3
CNN	2134.45	2013.7	1966.75
LSTM	1998.5	2010.6	2034.45
GRU	1962.8	1987.44	2001.69
Bi-LSTM	1865.7	1871.4	1885.6
Bi-GRU	1851.4	1849.77	1880.5
MD-GRU	1598.6	1687.5	1654.36
MD-GRU-CNN	1418.5	1380.5	1372.96

[1]	Y. Lu, G. Wang, A load forecasting model based on support vector regression with whale optimization algorithm, Multimed Tools Appl., 82 (2023), 9939–9959. https://doi.org/10.1007/s11042-022-13462-2 doi: 10.1007/s11042-022-13462-2
[2]	H. Habbak, M. Mahmoud, K. Metwally, M. M. Fouda, M. I. Ibrahem, Load forecasting techniques and their applications in smart grids, Energies, 16 (2023), 1480. https://doi.org/10.3390/en16031480 doi: 10.3390/en16031480
[3]	L. Zhang, J. Wen, Y. Li, J. Chen, Y. Ye, Y. Fu, et al., A review of machine learning in building load prediction, Appl. Energy, 285 (2021), 116452. https://doi.org/10.1016/j.apenergy.2021.116452 doi: 10.1016/j.apenergy.2021.116452
[4]	M. Zulfiqar, M. Kamran, M. B. Rasheed, T. Alquthami, A. H. Milyani, A short-term load forecasting model based on self-adaptive momentum factor and wavelet neural network in smart grid, IEEE Access, 10 (2022), 77587–77602. https://doi.org/10.1109/ACCESS.2022.3192433 doi: 10.1109/ACCESS.2022.3192433
[5]	R. Liu, T. Chen, G. Sun, S. M. Muyeen, S. Lin, Y. Mi, Short-term probabilistic building load forecasting based on feature integrated artificial intelligent approach, Electr. Pow. Syst. Res., 206 (2022), 107802. https://doi.org/10.1016/j.epsr.2022.107802 doi: 10.1016/j.epsr.2022.107802
[6]	I. Yazici, O. F Beyca, D. Delen, Deep-learning-based short-term electricity load forecasting: A real case application, Eng. Appl. Artif. Intell., 109 (2022), 104645. https://doi.org/10.1016/j.engappai.2021.104645 doi: 10.1016/j.engappai.2021.104645
[7]	A. Goia, C. May, G. Fusai, Functional clustering and linear regression for peak load forecasting, Int. J. Forecast, 26 (2010), 700–711. https://doi.org/10.1016/j.ijforecast.2009.05.015 doi: 10.1016/j.ijforecast.2009.05.015
[8]	A. H. Nury, K. Hasan, A. M. J. Bin, Comparative study of wavelet-ARIMA and wavelet-ANN models for temperature time series data in northeastern Bangladesh, J. King, Saud. Univ. Sci., 29 (2017), 47–61. https://doi.org/10.1016/j.jksus.2015.12.002 doi: 10.1016/j.jksus.2015.12.002
[9]	G. Y. Chen, M. Gan, G. L. Chen, Generalized exponential autoregressive models for nonlinear time series: Stationarity, estimation and applications, Inf. Sci., 438 (2018), 46–57. https://doi.org/10.1016/j.ins.2018.01.029 doi: 10.1016/j.ins.2018.01.029
[10]	S. Deng, F. Chen, X. Dong, G. Gao, X. Wu, Short-term load forecasting by using improved GEP and abnormal load recognition, ACM Trans. Inter. Technol., 21 (2021), 1–28. https://doi.org/10.1145/3447513 doi: 10.1145/3447513
[11]	J. Lee, Y. Cho, National-scale electricity peak load forecasting: Traditional, machine learning, or hybrid model? Energy, 239 (2022), 122366. https://doi.org/10.1016/j.energy.2021.122366 doi: 10.1016/j.energy.2021.122366
[12]	T. Alquthami, M. Zulfiqar, M. Kamran, A. H. Milyani, M. B. Rasheed, A performance comparison of machine learning algorithms for load forecasting in smart grid, IEEE Access, 10 (2022), 48419–48433. https://doi.org/10.1109/ACCESS.2022.3171270 doi: 10.1109/ACCESS.2022.3171270
[13]	Z. Li, J. Wang, J. Huang, M. Ding, Development and research of triangle-filter convolution neural network for fuel reloading optimization of block-type HTGRs, Appl. Soft Comput., 136 (2023), 110126. https://doi.org/10.1016/j.asoc.2023.110126 doi: 10.1016/j.asoc.2023.110126
[14]	S. Deng, F. Chen, D. Wu, Y. He, H. Ge, Y. Ge, Quantitative combination load forecasting model based on forecasting error optimization, Comput. Elec Engin, 101 (2022), 108125. https://doi.org/10.1016/j.compeleceng.2022.108125 doi: 10.1016/j.compeleceng.2022.108125
[15]	S. Sun, Y. Liu, Q. Li, T. Wang, F. Chu, Short-term multi-step wind power forecasting based on spatio-temporal correlations and transformer neural networks, Energy Convers. Manage., 283 (2023), 116916. https://doi.org/10.1016/j.enconman.2023.116916 doi: 10.1016/j.enconman.2023.116916
[16]	Z. Xiao, S. J. Ye, B. Zhong, C. X. Sun, Short term load forecasting using neural network with rough set, Conference: Advances in Neural Networks-ISNN 2006, Third International Symposium on Neural Networks, Chengdu, China, May 28-June 1, 2006, Proceedings, Part Ⅱ. https://doi.org/10.1007/11760023_183
[17]	C. X. Li, D. X. Niu, L. M. Meng, Rough set combine BP neural network in next day load curve forcasting, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 5264 LNCS: 2008, 1–10. https://doi.org/10.1007/978-3-540-87734-9_1
[18]	Z. Xiao, S. J. Ye, B. Zhong, C. X. Sun, BP neural network with rough set for short term load forecasting, Expert Syst. Appl., 36 (2009), 273–279. https://doi.org/10.1016/j.eswa.2007.09.031 doi: 10.1016/j.eswa.2007.09.031
[19]	D. Yi, S. Bu, I. Kim, An Enhanced Algorithm of RNN Using Trend in Time-Series, Symmetry, 11 (2019), 912. https://doi.org/10.3390/sym11070912 doi: 10.3390/sym11070912
[20]	V. Kusuma, A. Privadi, A. L. S. Budi, V. L. B. Putri, Photovoltaic Power Forecasting Using Recurrent Neural Network Based on Bayesian Regularization Algorithm. ICPEA 2021-2021 IEEE International Conference in Power Engineering Application, (2021), 109–114. https://doi.org/10.1109/ICPEA51500.2021.9417833 doi: 10.1109/ICPEA51500.2021.9417833
[21]	G. Li, H. Wang, S. Zhang, J. Xin, H. Liu, Recurrent neural networks based photovoltaic power forecasting approach, Energies, 12 (2019), 2538. https://doi.org/10.3390/en12132538 doi: 10.3390/en12132538
[22]	A. Buonanno, M. Caliano, A. Pontecorvo, G. Sforza, M. Valenti, G. Graditi, Global vs. local models for short‐term electricity demand prediction in a Residential/Lodging scenario, Energies, 15 (2022), 2037. https://doi.org/10.3390/en15062037 doi: 10.3390/en15062037
[23]	R. Quan, Z. Li, P. Liu, Y. Li, Y. Chang, H. Yan, Minimum hydrogen consumption-based energy management strategy for hybrid fuel cell unmanned aerial vehicles using direction prediction optimal foraging algorithm, Fuel Cells, 23 (2023), 221–236. https://doi.org/10.1002/fuce.202200121 doi: 10.1002/fuce.202200121
[24]	S. Hochreiter, J. Schmidhuber, Long short-term memory, Neural Comput., 9 (1997), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735 doi: 10.1162/neco.1997.9.8.1735
[25]	J. Chung, C. Gulcehre, K. H. Cho, Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence modeling, NIPS 2014 Deep Learning and Representation Learning Workshop, 2014. https://doi.org/10.48550/arXiv.1412.3555
[26]	A. K. Tyagi, N. Sreenath, Cyber physical systems: Analyses, challenges and possible solutions, Int. Thing. Cyber-Physical Syst., 1 (2021), 22–33. https://doi.org/10.1016/j.iotcps.2021.12.002 doi: 10.1016/j.iotcps.2021.12.002
[27]	J. Moon, S. Park, S. Rho, E. Hwang, A comparative analysis of artificial neural network architectures for building energy consumption forecasting, Int. J. Distrib. Sens. N., 15 (2019). https://doi.org/10.1177/1550147719877616 doi: 10.1177/1550147719877616
[28]	T. Walser, A. Sauer, Typical load profile-supported convolutional neural network for short-term load forecasting in the industrial sector, Energy AI., 5 (2021), 100104. https://doi.org/10.1016/j.egyai.2021.100104 doi: 10.1016/j.egyai.2021.100104
[29]	X. Ke, L. Shi, W. Guo, D. Chen, Multi-Dimensional traffic congestion detection based on fusion of visual features and convolutional neural network, IEEE T. Intell. Transp., 20 (2019), 2157–2170. http://www.ieee.org/publications_standards/publications/rights/index.html
[30]	P. H. Kuo, C. J. Huang, A high precision artificial neural networks model for short-term energy load forecasting, Energies, 11 (2018) 213. https://doi.org/10.3390/en11010213 doi: 10.3390/en11010213
[31]	J. Walther, D. Spanier, N. Panten, E. Abele, Very short-term load forecasting on factory level—A machine learning approach, Procedia CIRP, 80 (2019), 705–710. https://doi.org/10.1016/j.procir.2019.01.060 doi: 10.1016/j.procir.2019.01.060
[32]	T. Hong, J. Wilson, J. Xie, Long term probabilistic load forecasting and normalization with hourly information, IEEE T. Smart Grid, 5 (2014), 456–462. https://doi.org/10.1109/TSG.2013.2274373 doi: 10.1109/TSG.2013.2274373
[33]	B. M. Hodge, D. Lew, M. Milligan, Short-term load forecast error distributions and implications for renewable integration studies, IEEE Green Technologies Conference, (2013), 435–442. https://doi.org/10.1109/GreenTech.2013.73 doi: 10.1109/GreenTech.2013.73
[34]	H. M. Al-Hamadi, S. A. Soliman, Long-term/mid-term electric load forecasting based on short-term correlation and annual growth, Electr. Pow. Syst. Res., 74 (2005), 353–361.
[35]	X. Sun, Z. Ouyang, D. Yue, Short-term load forecasting model based on multi-label and BPNN. Comm. Comp. Infor. Sci., 761 (2017), 263–272. https://doi.org/10.1007/978-981-10-6370-1_26 doi: 10.1007/978-981-10-6370-1_26
[36]	W. Tang, F. He, Y. Liu, YDTR: Infrared and visible image fusion via Y-shape dynamic transformer, IEEE T. Multimedia, (2022), 1–16. https://doi.org/10.1109/TMM.2022.3192661 doi: 10.1109/TMM.2022.3192661
[37]	A. A. Peñaloza, R. C. Leborgne, A. Balbinot, Comparative analysis of residential load forecasting with different levels of aggregation, Eng. Proc, 18 (2022), 29. https://doi.org/10.3390/engproc2022018029 doi: 10.3390/engproc2022018029
[38]	T. Bashir, C. Haoyong, M. F. Tahir, Z. Liqiang, Short term electricity load forecasting using hybrid prophet-LSTM model optimized by BPNN, Energy Rep., 8 (2022), 1678–1686. https://doi.org/10.1016/j.egyr.2021.12.067 doi: 10.1016/j.egyr.2021.12.067
[39]	Y. Song, F. He, Y. Duan, Y. Liang, X. Yan, A kernel correlation-based approach to adaptively acquire local features for learning 3D point clouds, Comput. Aided Design, 146 (2022), 103196. https://doi.org/10.1016/j.cad.2022.103196 doi: 10.1016/j.cad.2022.103196
[40]	A. H. Nury, K. Hasan, M. J. B. Alam, Comparative study of wavelet-ARIMA and wavelet-ANN models for temperature time series data in northeastern Bangladesh, J. King Saud. Univ. Sci., 29 (2017), 47–61. https://doi.org/10.1016/j.jksus.2015.12.002 doi: 10.1016/j.jksus.2015.12.002

1.	Md Babul Islam, Antonio Guerrieri, Raffaele Gravina, Giancarlo Fortino, A Meta-Survey on Intelligent Energy-Efficient Buildings, 2024, 8, 2504-2289, 83, 10.3390/bdcc8080083
2.	Chenmin Ni, Muhammad Fadhil Marsani, Fam Pei Shan, Xiaopeng Zou, Flood prediction with optimized gated recurrent unit-temporal convolutional network and improved KDE error estimation, 2024, 9, 2473-6988, 14681, 10.3934/math.2024714
3.	Francesca Villano, Gerardo Maria Mauro, Alessia Pedace, A Review on Machine/Deep Learning Techniques Applied to Building Energy Simulation, Optimization and Management, 2024, 4, 2673-7264, 100, 10.3390/thermo4010008
4.	Shuguang Li, Kashif Ali, Salem Algarni, Talal Alqahtani, Sohail Ahmad, Fayza Abdel Aziz ElSeabee, Hameed Ullah, Wasim Jamshed, Kashif Irshad, Numerical analysis of thermophoretic particle deposition in a magneto-Marangoni convective dusty tangent hyperbolic nanofluid flow – Thermal and magnetic features, 2024, 13, 2191-9097, 10.1515/ntrev-2023-0190
5.	Salvatore Mancha Gonzales, Hasnain Iftikhar, Javier Linkolk López-Gonzales, Analysis and forecasting of electricity prices using an improved time series ensemble approach: an application to the Peruvian electricity market, 2024, 9, 2473-6988, 21952, 10.3934/math.20241067
6.	Zheng Grace Ma, Nicolai Bo Vanting, Bo Nørregaard Jørgensen, 2024, Residential Heating Demand Prediction Using Time-Series Clustering and Deep Learning, 979-8-3315-1013-8, 1, 10.1109/ICSAI65059.2024.10893794
7.	Xiaojun Wang, Linrong Wang, Nan Liang, Machine learning-driven power demand forecasting models for optimized power management, 2025, 0948-7921, 10.1007/s00202-025-03140-5
8.	Chingfei Luo, Chenzi Liu, Chen Huang, Meilan Qiu, Dewang Li, ARIMA Markov Model and Its Application of China’s Total Energy Consumption, 2025, 18, 1996-1073, 2914, 10.3390/en18112914
9.	Simeng Zhang, Houqing Zhang, Yangcheng Hu, 2025, Statistical Forecasting of Short-term Electricity Load Using a Meteorological Feature-driven Deep Learning Fusion Framework, 979-8-3315-2184-4, 1067, 10.1109/CEEPE64987.2025.11034044

AIMS Mathematics

Multi-directional gated recurrent unit and convolutional neural network for load and energy forecasting: A novel hybridization

Related Papers:

Abstract

1. Introduction

2. Related work

3. Deep learning techniques for load and energy forecasting model

3.1. Recurrent neural network

3.2. Long short-term Memory and bidirectional module

3.3. Gated recurrent unit and bidirectional module

3.4. Convolutional neural network

4. Deep learning techniques for load and energy forecasting model

4.1. Multidirectional gated recurrent unit

4.2. MD-GRU-CNN module

5. Experimentations

5.1. Dataset descriptions

5.2. Metrics used for evaluating the proposed approach

5.3. Baseline methods

5.4. Experimental setup

5.5. Experimental results and analysis

6. Discussion

7. Conclusions and future work

Acknowledgment

Conflict of interest

Use of AI tools declaration

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog