COVID-19 pandemic is spreading around the world becoming thus a serious concern for health, economic and social systems worldwide. In such situation, predicting as accurately as possible the future dynamics of the virus is a challenging problem for scientists and decision-makers. In this paper, four phenomenological epidemic models as well as Suspected-Infected-Recovered (SIR) model are investigated for predicting the cumulative number of infected cases in Saudi Arabia in addition to the probable end-date of the outbreak. The prediction problem is formulated as an optimization framework and solved using a Particle Swarm Optimization (PSO) algorithm. The Generalized Richards Model (GRM) has been found to be the best one in achieving two objectives: first, fitting the collected data (covering 223 days between March 2nd and October 10, 2020) with the lowest mean absolute percentage error (MAPE = 3.2889%), the highest coefficient of determination (R2 = 0.9953) and the lowest root mean squared error (RMSE = 8827); and second, predicting a probable end date found to be around the end of December 2020 with a projected number of 378,299 at the end of the outbreak. The obtained results may help the decision-makers to take suitable decisions related to the pandemic mitigation and containment and provide clear understanding of the virus dynamics in Saudi Arabia.
Citation: Rafat Zreiq, Souad Kamel, Sahbi Boubaker, Asma A Al-Shammary, Fahad D Algahtani, Fares Alshammari. Generalized Richards model for predicting COVID-19 dynamics in Saudi Arabia based on particle swarm optimization Algorithm[J]. AIMS Public Health, 2020, 7(4): 828-843. doi: 10.3934/publichealth.2020064
Related Papers:
[1]
Aaron C Shang, Kristen E Galow, Gary G Galow .
Regional forecasting of COVID-19 caseload by non-parametric regression: a VAR epidemiological model. AIMS Public Health, 2021, 8(1): 124-136.
doi: 10.3934/publichealth.2021010
[2]
Sushant K Singh .
COVID-19: A master stroke of Nature. AIMS Public Health, 2020, 7(2): 393-402.
doi: 10.3934/publichealth.2020033
[3]
Saina Abolmaali, Samira Shirzaei .
A comparative study of SIR Model, Linear Regression, Logistic Function and ARIMA Model for forecasting COVID-19 cases. AIMS Public Health, 2021, 8(4): 598-613.
doi: 10.3934/publichealth.2021048
[4]
Yosef Mohamed-Azzam Zakout, Fayez Saud Alreshidi, Ruba Mustafa Elsaid, Hussain Gadelkarim Ahmed .
The magnitude of COVID-19 related stress, anxiety and depression associated with intense mass media coverage in Saudi Arabia. AIMS Public Health, 2020, 7(3): 664-678.
doi: 10.3934/publichealth.2020052
[5]
Christopher Kitchen, Elham Hatef, Hsien Yen Chang, Jonathan P Weiner, Hadi Kharrazi .
Assessing the association between area deprivation index on COVID-19 prevalence: a contrast between rural and urban U.S. jurisdictions. AIMS Public Health, 2021, 8(3): 519-530.
doi: 10.3934/publichealth.2021042
[6]
Kathleen Lois Foster, Alessandro Maria Selvitella .
On the relationship between COVID-19 reported fatalities early in the pandemic and national socio-economic status predating the pandemic. AIMS Public Health, 2021, 8(3): 439-455.
doi: 10.3934/publichealth.2021034
[7]
Ali Roghani .
The relationship between macro-socioeconomics determinants and COVID-19 vaccine distribution. AIMS Public Health, 2021, 8(4): 655-664.
doi: 10.3934/publichealth.2021052
[8]
Chun Nok Lam, William Nicholas, Alejandro De La Torre, Yanpui Chan, Jennifer B. Unger, Neeraj Sood, Howard Hu .
Factors associated with parents' willingness to vaccinate their children against COVID-19: The LA pandemic surveillance cohort study. AIMS Public Health, 2022, 9(3): 482-489.
doi: 10.3934/publichealth.2022033
[9]
Musyoka Kinyili, Justin B Munyakazi, Abdulaziz YA Mukhtar .
Mathematical modeling and impact analysis of the use of COVID Alert SA app. AIMS Public Health, 2022, 9(1): 106-128.
doi: 10.3934/publichealth.2022009
[10]
Ali Moussaoui, El Hadi Zerga .
Transmission dynamics of COVID-19 in Algeria: The impact of physical distancing and face masks. AIMS Public Health, 2020, 7(4): 816-827.
doi: 10.3934/publichealth.2020063
Abstract
COVID-19 pandemic is spreading around the world becoming thus a serious concern for health, economic and social systems worldwide. In such situation, predicting as accurately as possible the future dynamics of the virus is a challenging problem for scientists and decision-makers. In this paper, four phenomenological epidemic models as well as Suspected-Infected-Recovered (SIR) model are investigated for predicting the cumulative number of infected cases in Saudi Arabia in addition to the probable end-date of the outbreak. The prediction problem is formulated as an optimization framework and solved using a Particle Swarm Optimization (PSO) algorithm. The Generalized Richards Model (GRM) has been found to be the best one in achieving two objectives: first, fitting the collected data (covering 223 days between March 2nd and October 10, 2020) with the lowest mean absolute percentage error (MAPE = 3.2889%), the highest coefficient of determination (R2 = 0.9953) and the lowest root mean squared error (RMSE = 8827); and second, predicting a probable end date found to be around the end of December 2020 with a projected number of 378,299 at the end of the outbreak. The obtained results may help the decision-makers to take suitable decisions related to the pandemic mitigation and containment and provide clear understanding of the virus dynamics in Saudi Arabia.
1.
Introduction
In early 2019, the COVID-19 disease has started spreading from Wuhan city, Hubei province in China. Few days later, it quickly spread across the world infecting 39,170,503 persons and causing the death of 1,102,926 as of October 16, 2020. Although 29,378,739 individuals have recovered/discharged, 8,688,838 are reported as still active cases [1]. Several actions have been implemented by different countries ranging from total lockdown to social distancing measures in order to contain the virus and limit its spread [2]. Moreover, economic activities and health systems around the globe have been extremely affected which made the world in front of an unprecedented difficult situation.
Large attention is being paid by scientists from different backgrounds since they are hardly working on COVID-19 various aspects including the virus dynamics modeling. Several researches have been published aiming to predict the number of infected cases, the number of deaths and more particularly a probable end date in different countries and sometimes in different cities (provinces) inside the same country. As examples, we can cite references [3] for Saudi Arabia, [4] for Kuwait and [5] for India. The reported results have shown varying degrees of accuracy and reliability due to several causes including the quality and quantity of available information about the virus [6]. Suspected-Infected-Recovered (SIR) model as well as its variants have been used to predict the virus dynamics [2],[4],[7]. Although these models are continuous-time and the data of reported cases are available in discrete-time frequency (daily), several studies have considered SIR based on discrete-time frameworks [8]–[10]. The effect of containment and control measures has been also considered by using appropriate values of the model parameters. Moreover, time-series models such as autoregressive moving average (ARIMA) [11]–[12] and its variants in addition to different artificial intelligence (AI) based models are facing luck of sufficient information both quantitatively and qualitatively [5]–[6],[10],[12].
The logistic growth model and many of its variants have been utilized for predicting COVID-19 future in different countries. For example, the study in [3] have developed simultaneously SIR and logistic growth models for the case study of Saudi Arabia using data between March 2nd and May 15th, 2020. Unfortunately, by checking back their forecasted cumulative infected numbers and projected end date, it has been found that their projections were inaccurate since they expected a number ranging between 69,000 and 79,000 at the end of June 2020 as probable end date (the real number was 190,823 in June 30, 2020) [13]. The case study of Kuwait using logistic regression models has been examined in [4]. The predicted and real total infected numbers are checked and found to be different at the projected end date (predicted: 4,100, real: 17,700) [14]. Five-parameter logistic growth model has been used for reconstructing COVID-19 data in USA [15]. Results showed a good fit and a relatively accurate estimation of new infected cases as for April 4th, 2020 but unfortunately the developed model has failed in predicting the end date since the virus continues its increase till today (October 16, 2020). The case study of India has been also investigated using logistic growth model in [16]. The developed model has been found to over-estimate the total number of cumulative infected cases by May 22, 2020 (predicted: 1 million, reported: 124,000). The drawback of the logistic growth model is that it is suitable only for modeling early stages of epidemics [17]. Combined models have been also considered for predicting COVID-19 [18]–[20]. A combination of logistic model with machine learning [18], ANFIS with virus optimization algorithm [19] and Gaussian mixture model are examples of such frameworks reported to be efficient. As a conclusion, it is clearly observed that most of the developed models succeeded in fitting the historical data but faced difficulties in predicting the number of infected cases in the coming future.
Until now, based on the above discussed references and to the best of the authors' knowledge, few researches have used phenomenological models for COVID-19 forecasting. Moreover, few studies that considered using Richards model combined with a global optimization swarm intelligence technique (the Particle Swarm Optimization (PSO)) for predicting COVID-19 dynamics are available. For this aim, this study will focus on implementing phenomenological models, namely, generalized growth model (GGM), generalized logistic model (GLM), classical logistic growing model (CLGM) and generalized Richards model (GRM) to predict the dynamics of COVID-19 dynamics in Saudi Arabia. SIR model will be also implemented on the same dataset for comparison purpose. The main motivations behind this choice are summarized as follows: (i) this class of models is known to be robust, (ii) they include few parameters to be calibrated optimally and (iii) COVID-19 (like any other epidemic) should pass through an exponential growing phase at the beginning, should reach a peak and should exhibit a flat and stable curve which is well-captured by both CLGM and GRM. In this study, the identification of the models' parameters is defined as a minimization problem. The quadratic error between the cumulative infected cases and the reported infected cases is minimized by respect to the model parameters. The identification problem is then solved using the global PSO technique. The uniqueness and the stability of the developed models will also be investigated.
The remaining of this paper is organized as follows. Section 2 presents the models investigated in this study as well as details about the computational methodology used to identify the models' parameters. In Section 3, the results of the conducted experiments using Saudi Arabia case study are provided. The obtained results will be discussed in Section 4. Finally, conclusion and recommendations are provided in Section 5.
2.
Materials and methods
In this section, the four phenomenological models as well as the SIR compartmental model used to forecast the dynamics of COVID-19 in short-term horizon are presented and discussed in detail [21]–[23],[33],[34].
2.1. Generalized growth model (GGM)
The generalized growth model is described in Eq 1 below:
dC(t)dt=rC(t)p
This model is usually used in early stages of an epidemic spread. It relies on two parameters; namely the intrinsic growth rate, r and the scaling growth rate, p. According to the value of p, three growth profiles can occur: a straight linear behavior (p = 0), an exponential growth (p = 1) and a sub-exponential growth (p < 1). In a model identification framework, the parameter p search space is set to be the interval [0–1]. C(t) denotes the cumulative number of infected cases and dC(t)dt is its derivative with respect to the time, t.
2.2. Classical logistic growth model (CLGM)
The classical logistic growth model is described in Eq 2.
dC(t)dt=rC(t)(1−C(t)K)
The model parameters are the same as in the GGM. This model is expected to capture both the early stage exponential curve and a steady state (flat curve) reached at the end of the epidemic. During the late stages of the virus, the number of new cases becomes small and the number of cumulative cases becomes constant equal to the final capacity, K.
2.3. Generalized logistic model (GLM)
The generalized logistic model is described in Eq 3 below:
dC(t)dt=rC(t)p(1−C(t)K)
This model is similar to the CLGM except of the inclusion of a scaling growth rate, p.
2.4. Generalized Richards model (GRM)
The generalized Richards model is described in Eq 4 below:
dC(t)dt=rC(t)p(1−(C(t)K)α)
This model includes all the features covered by the previous three models and it includes an exponent factor α used to capture the deviation of the symmetric S-shaped dynamics of the simple logistic model. It is known to be pertinent since it can capture the epidemic curve in all its phases.
2.5. Suspected-Infected-Recovered (SIR) model
In this paper, the SIR model is adopted since it has been reported to be simpler than other complicated models such as SIER [8],[35]–[37]. It will be used for comparison with the developed four logistic models. SIR model includes a set of differential equations (in continuous-time) describing the relationships between subsets of a population including Suspected (S), Infected (I) and Recovered (R) [9]–[10]. In SIR model, the dynamics of the COVID-19 are described as follows Eq 5–Eq 7:
dS(t)dt=−KS(t)I(t)
dI(t)dt=KS(t)I(t)−1βI(t)
dR(t)dt=1βI(t)
where K is the contact rate expressing the probability of being infected and β is the characteristic duration of the disease.
Since the COVID-19 pandemic data are available in a discrete-time level (daily), finding SIR model solutions should be based on efficient algorithms operating on discrete-time [9]. For this purpose, the first-order Euler method is used for transforming the SIR continuous-time model (Eq 5–Eq 7) into a discrete-time form as follows (Eq 8–Eq 10) (for this purpose, dS(t) = S(t+dt) − S(t) and dt = 1 day):
S(t+1)=S(t)−KS(t)I(t)
I(t+1)=I(t)+KS(t)I(t)−1βI(t)
R(t+1)=R(t)+1βI(t)
where the subscript t indicates the day number.
2.6. Computational methodology
PSO has been used in the literature for modeling the COVID-19 dynamics. For example, in [24], PSO has been used among other metaheuristic algorithms to tune optimally the parameters of an adaptive neuro-fuzzy inference system (ANFIS) used for predicting the number of infected cases in upcoming days in China. PSO has provided good results in terms of coefficient of determination (R2 = 0.9492) and mean absolute percentage error (MAPE = 5.12%). The PSO technique has been also used successfully in calibrating SEIR model for the case study of Hubei, China [25]. In addition to a good data fitting, the PSO algorithm helped in detecting several nonlinear aspects including chaos. The identified model parameters have been used later to establish efficient control strategies. Robust machine learning approach based on PSO has been investigated by [26] to predict the number of infected individuals in Italy. Seven-parameters SEIR model has been found to provide accuracies in the number of susceptible cases ranging from 10% of the population to 40% respectively in Lombardy and Valle d'Aosta.
In this study, the identification of the five proposed models' parameters is performed using a swarm intelligence population-based algorithm, the particle swarm optimization (PSO) [27]. PSO is an emerging optimization technique inspired from animals' social behavior such as bird flocking or fish schooling. It is known to be easy to implement and simple in its intuitive principle [28]. In the PSO principle, the swarm is composed of S particles coding each one a candidate solution for the epidemic model parameters' identification problem. During the optimization procedure, these particles fly across the D-dimensional search-space looking for the optimal position leading to the minimum value of an objective function. The objective function in this study is the nonlinear least-square error between the reported cumulative infected cases and the number of cases calculated using the D parameters of the model associated to each particle. After being initialized randomly at the beginning of the optimization process, the particle move is based on three components: following its current direction, returning back to its best personal position so far visited and moving toward the position reached by the best particle in the swarm. Let i be the index of the ith particle and j be the jth dimension of the model parameters vector. The particle position is defined as:
Xi=Xi1,Xi2……Xij……XiD I=1,2,…,S
To the ith particle are associated three vectors having the same size and the same parameters orders as in Xi. Pi, Gi and Vi denote respectively the best position so far visited, the position of the best particle among the swarm and the current velocity vector. During the optimization process (with k denotes the iteration index), each particle inside the swarm evaluates its current objective function and updates its position following the set of Eq 12–Eq 14:
Vik+1=wkVik+c1r1(Pi−Xi)+c2r2(Gi−Xi)
Xik+1=Xik+Vik+1
wk=wmax−wmax−wminkmax×k
The inertia weight coefficient is included to create a lance between global search ability at the beginning (high value of the inertia weight) and local search ability at the end of the of optimization procedure. The inertia weight decreases linearly across the iteration index. The computations are stopped when a maximum value of the iteration index will be reached. The adopted solution of the identification problem is reported as the position vector of the global best particle at the final iteration. The optimization problem is then defined as follows:
ˆX=argmin∑t=Nt=1(C(t,X)−C(t))2
Xjl≤Xj≤Xju
Xjl and Xju denote respectively the lower and upper limits of the jth dimension in X.
For the SIR model, the criterion to be minimized is given below (Eq 17):
t0 and tf are the first day and the last day of the COVID-19 collected data, respectively. ˆS(t),ˆI(t)andˆR(t) are the estimates of S, I and R at the tth day.
3.
Results
In this section, the results of using the particle swarm optimization (PSO) algorithm described in the previous section for calibrating four phenomenological epidemic models and SIR compartmental model using data of COVID-19 cumulative cases in Saudi Arabia covering the period from March 2nd to October 10th, 2020 are provided. The PSO setting parameters are presented in Table 1.
Table 1.
Parameter settings of the PSO algorithm.
Parameter
Meaning
Value
S
Number of particles
50
kmax
Maximum iteration number used as stopping criterion
The dataset used to calibrate the five previously described models is collected from the Saudi Ministry of Health website covering the period between March 2nd and October 10th, 2020. This dataset includes 223 daily observations which have been divided into two subsets: 190 observations used for training the models and the remaining 33 observations used for the models' validation. Testing dataset is composed of the five days immediately following October 10, 2020. This dataset is not included in the training and validation phases. When examining the Saudi Arabia's COVID-19 curves, it is easy to detect that an exponential growth for the cumulative infected cases occurs at the beginning of the pandemic and that around September 20, 2020, the curve starts becoming flat. The new daily reported cases' curve exhibits a clear fluctuating and variable behavior presenting four peaks (May 17, June 17, June 30 and July 6). This fluctuation is correlated with numerous measures implemented by the Saudi authorities to mitigate the virus spread while considering economic and social issues.
To compare the four models' prediction results, three statistical performance indicators, namely, the coefficient of determination (R2), the mean absolute percentage error (MAPE), the root mean squared error (RMSE) [27] and the Kolmogorov-Smirnov (K-S) test [29] are used. K-S is a two-sample test used to compare the cumulative distribution probabilities of the reported number of infected individuals and the same number forecasted by one of the investigated models. In Table 2 below, H = 0 indicates that “Do not reject the null hypothesis at the 5% significance level” and H = 1 indicates that “Reject the null hypothesis at the 5% significance level”.
In this paper, the results of the best run of each model are reported since PSO is a stochastic optimization technique which need to be run many times. Table 2 shows the optimal parameters for each model as well as the values of the performance indicators including the K-S test result.
Table 2.
Results of the best run of each prediction model.
From Table 2, it can be seen that the generalized Richards model (GRM) outperforms all the other four models in terms of all performance indicators. Moreover, this model allows predicting a probable end date (found to be around the end of 2020). The SIR model is ranked in the second place after the GRM in terms of performance metrics. Both GRM and SIR models were valid according to the Kolmogorov-Smirnov K-S test indicating that they provide forecasted and real (reported) infections derived from similar empirical distribution functions [29]. Both models provide also almost similar values of the coefficient of determination R2 (0.9953 for GRM and 0.9926 for SIR). This result can be attributed to the fact that logistic models are rigorously derived from the simple epidemiological SIR model as reported in [9],[30]. The third-ranked model is found to be the GLM. In fact, it provided a probable end date near from the one provided by the best model (the GRM) but the number of infected cases at the end of the pandemic is higher than the one provided by the GRM. The GLM model is found to be valid according to the K-S test. The fourth model is CLGM. In fact, it yielded a final capacity of the virus almost near of the one issued from the best model (GRM) but an end date around the end of August, 2020 which is not true according to the current COVID-19 statistics in Saudi Arabia. Moreover, the K-S test was non valid for the CLGM model. It should be noted that, according to the curve in Figure 4, the CLGM has a chance to meet the end of the virus capacity since the real curve is still under the curve provided by the CLGM. The last model is the GGM. This model is found to be good in fitting data for the early stage of the epidemic but it becomes inaccurate in the coming stages [31]. Conceptually, the GGM model cannot capture the probable end of the outbreak since its curve will continue growing with time. In Table 3, the actual and forecasted (by the GRM) cumulative number of infected cases are provided for the validation phase. The differences are relatively small which confirms the good fit performance of the GRM when applied to a dataset not used during the training phase.
Table 3.
Reported and predicted cumulative infected cases for the validation phase (day#1 is September 8,2020): results of the GRM.
In Table 4, the number of cumulative infected cases provided by the GRM as well as real reported numbers are compared. A relative error around 4% is found to characterize the model. Note here that the considered five days were not included in the model training and validation phases.
4.2. Analysis of the generalized Richards model (GRM) forecasts
The generalized Richards model (GRM) is found to outperform all the other four models developed in this paper. Therefore, its main features including the advantages, the limitations and the robustness will be discussed in detail. In forecasting exercises, the data available at the hand of the designer is usually divided into 80% for training the model and 20% for validating it. However, when investigating our models including the GRM, we have tried other choices for the two subsets and we found that this didn't affect a lot the overall forecasting performances more particularly for the case of the GRM. One of the main drawbacks of the GRM is that it can't predict the fatality curve like in other researches such as [38].
Possible sources of errors in final estimates of the cumulative number of infections have been included in the models' parameters through the particle swarm optimization (PSO) algorithm. In fact, during the parameters' identification process, the vector of parameters is perturbed using the r1 and r2 random numbers (see Table 1: parameter settings of the PSO and Eq 12: particle move). A stretched logistic equation for COVID-19 spreading in Italy has been proposed in [30]. This model is expected to take into account the time-dependency of the growth rate. By comparing the curve provided in Figure 6 of [30] and the curve provided by our GRM model (Figure 5 of this paper), we can conclude that our model is behaving similarly to a stretched logistic model with the difference that in Saudi Arabia, COVID-19 pandemic didn't reach the flat curve, indicating the end of the virus current wave, yet.
In order to highlight the effect of the changing situation of transmissibility, several scenarios related to intermediate date ranges have been investigated for the GRM as representative of the studied four logistic models. The GRM has been calibrated respectively using the first 100, 150, 200 and 223 days used for the model training. The results of different scenarios are summarized in Table 5.
Table 5.
Results of the GRM in different time scenarios.
As depicted in Table 5, the four runs' parameters are extremely different which reflects relatively high variability of the growth dynamics of the epidemic. The situation of transmissibility is found to be changing. This fact is also clear in Figure 1 where it is shown that the curve of new infections presented four local peaks. Obviously, the coefficient of determination R2 increases when the length of the sample used for training the GRM increases. This confirms also that using more data for training may allow detecting different dynamics of the COVID-19 pandemic and therefore being able to explain the effect of the virus containment measures [39] and the efficiency of the social behavior including isolation like in the case of Brazil [40].
• The GRM model has been trained using other parameter fitting techniques (genetic algorithm (ga) and Levenberg-Marquardt (LM)) in order to show the superiority of the particle swarm optimization technique. The PSO is found to be more efficient at least in three points: (1) The PSO has global search ability since the particles are initialized randomly over the whole search space at the beginning of the identification process; (2) The particles are then driven towards the “sub-optimal” solution. Following the inertia weight coefficient which favors the “exploration” at the beginning and favors the “exploitation” at the end of the identification process; and (3) The PSO has been found to operate in complex search-spaces that should be non-convex. However, both LM and “ga” have been trapped into local minima since they are sensitive to the initialization.
• The PSO technique used in this paper to calibrate all the developed models is known to be stochastic since it initializes the particles randomly within the search-space limits and it includes two numbers (r1 and r2) which are generated randomly inside the interval [0–1] at each iteration. The two random numbers multiply respectively the local search component and the global search component (see Eq 12). The particle move includes through the use of r1 and r2 some kind of perturbations reflecting thus the robustness of the derived solutions. Due to this random aspect, the algorithm has been run several times and the results of the best run are adopted. Thus, the PSO is found to yield “sub-optimal” solutions that are not unique. A trade-off between optimality and feasibility is ensured.
• In order to compare our approaches to other approaches applied to the case study of Saudi Arabia, three references studying the prediction of the COVID-19 number of infected cases are selected. The results of comparison are included in Table 6 below.
It can be concluded from this comparison that the results are extremely variable since the forecasting performance metrics depend on the length and quality of the available data, the virus stage and transmissibility variation, the measures taken by the local authorities and on the used method. Forecasting is then a case-sensitive exercise.
Table 6.
Comparison of the prediction of COVID-19 infected cases in Saudi Arabia.
Times-series ARIMA model and its variants for predicting the next four weeks infections
• R2: ranges from 0.46 to 0.99 • MAPE: ranges from 2.6% to 32.80%
Although the performance metrics are reported to be good, checking back the forecasted values showed the model to overestimate the number of cumulative infected case in the four weeks after the end of the dataset as well as an inaccurate end date of the outbreak
Logistic growth and SIR models for predicting more than one month of daily infected cases
• Growth model end date around the end of June 2020 with about 70,000 of total cases • SIR model end date was forecasted to be around the end of June and a cumulative number of 80,000
Both models are found to be inaccurate since the virus continues spreading as of October 18, 2020
Using a modified SIR model including parameters related to temperature, humidity, population density and the intensity of control measures taken by local governments.
Daily new cases found to range from 1,800 to 2,500 between May 15, 2020 and May 26, 2020
Forecasts where relatively accurate when compared to the reported cases
GRM (this work)
Daily from March 2, 2020 to October 10, 2020
Using the generalized Richards Model (GRM) for predicting cumulative infected cases
• R2: 0.9953 • MAPE: 3.2889% • Predicted end date: end of 2020 • Predicted cumulative number at the end of the outbreak: 378,299
In this paper, a modeling procedure based on phenomenological and compartmental models combined with the particle swarm optimization (PSO) technique is carried out using reported cases from Saudi Arabia for the period starting on March 2nd and ending on October 10, 2020. The cumulative infected cases and a probable end date of the COVID-19 pandemic are predicted using five models including the four-parameter generalized Richards model (GRM). This model has provided good fit of data and a probable projected end date around the end of 2020 with a total number of infected cases around 379 thousand. According to the study results, it has been shown that the COVID-19 outbreak is approaching its end. The Saudi experience can be considered as successful in containing the first wave until now and more efforts in enforcing social distancing should contribute to mitigate the virus.
Acknowledgments
This research has been funded by Scientific Research Deanship at University of Ha'il, Saudi Arabia through project number COVID-1936.
Conflict of interest
All authors declare no conflicts of interest in this paper.
Lalwani S, Sahni G, Mewara B, et al. (2020) Predicting optimal lockdown period with parametric approach using three-phase maturation SIRD model for COVID-19 pandemic. Chaos, Solitons Fractals 138: 109939. doi: 10.1016/j.chaos.2020.109939
[3]
Alboaneen D, Pranggono B, Alshammari D, et al. (2020) Predicting the Epidemiological Outbreak of the Coronavirus Disease 2019 (COVID-19) in Saudi Arabia. Int J Environ Res Public Health 17: 4568. doi: 10.3390/ijerph17124568
[4]
Almeshal A, Almazrouee A, Alenezi M, et al. (2020) Forecasting the Spread of COVID-19 in Kuwait Using Compartmental and Logistic Regression Models. Appl Sci 10: 3402. doi: 10.3390/app10103402
[5]
Arora P, Kumar H, Panigrahi P (2020) Prediction and analysis of COVID-19 positive cases using deep learning models: A descriptive case study of India. Chaos, Solitons and Fractals 139: 110017. doi: 10.1016/j.chaos.2020.110017
[6]
Rodriguez O, Conde-Gutierrez R, Hernadez-Gavier A (2020) Modeling and prediction of COVID-19 in Mexico applying mathematical and computational models. Chaos, Solitons Fractals 138: 109946. doi: 10.1016/j.chaos.2020.109946
[7]
Hu Z, Cui Q, Han J, et al. (2020) Evaluation and prediction of the COVID-19 variations at different input population and quarantine strategies, a case study in Guangdong province, China. Int J Infect Dis 95: 231-240. doi: 10.1016/j.ijid.2020.04.010
[8]
Roda W, Varughese M, Han D, et al. (2020) Why is it difficult to accurately predict the COVID-19 epidemic? Infect Dis Modell 5: 271-281. doi: 10.1016/j.idm.2020.03.001
[9]
Postnikov E (2020) Estimation of COVID-19 dynamics “on a back-of-envelope”: Does the simplest SIR model provide quantitative parameters and predictions? Chaos, Solitons Fractals 135: 109841. doi: 10.1016/j.chaos.2020.109841
[10]
Ndaïrou F, Area I, Nieto J, et al. (2020) Mathematical modeling of COVID-19 transmission dynamics with a case study of Wuhan. Chaos, Solitons Fractals 135: 109846. doi: 10.1016/j.chaos.2020.109846
[11]
Alzahrani S, Aljamaan I, Al-fakih E (2020) Forecasting the spread of the COVID-19 pandemic in Saudi Arabia using ARIMA prediction model under current public health interventions. J Infect Public Health 13: 914-919. doi: 10.1016/j.jiph.2020.06.001
[12]
Aslam M (2020) Using the Kalman filter with Arima for the COVID-19 pandemic dataset of Pakistan. Data Brief 31: 105854. doi: 10.1016/j.dib.2020.105854
Chen D, Chen X, Chen J (2020) Reconstructing and forecasting the COVID-19 epidemic in the United States using a 5-parameter logistic growth model. Global Health Res Policy 5: 25. doi: 10.1186/s41256-020-00152-5
[16]
Malavika B, Marimuth, Joy M, et al. (2020) Forecasting COVID-19 epidemic in India and high incidence states using SIR and logistic growth models. Clin Epidemiol Global Health .
[17]
Munayco C, Tariq A, Rothenberg R, et al. (2020) Early transmission dynamics of COVID-19 in a southern hemisphere setting: Lima-Peru: February 29 the March 30th, 2020. Infect Dise Modell 5: 338-345. doi: 10.1016/j.idm.2020.05.001
[18]
Wang P, Zheng X, Jiayang L, et al. (2020) Prediction of epidemic trends in COVID-19 with logistic model and machine learning technics. Chaos, Solitons Fractals 139: 110058. doi: 10.1016/j.chaos.2020.110058
[19]
Behnood A, Gholafshan A, Hosseini S (2020) Determinants of the infection rate of the COVID-19 in the U.S. using ANFIS and virus optimization algorithm (VOA). Chaos, Solitons Fractals 139: 110051. doi: 10.1016/j.chaos.2020.110051
[20]
Singhal A, Singh P, Lall B, et al. (2020) Modeling and prediction of COVID-19 pandemic using Gaussian mixture model. Chaos, Solitons Fractals 138: 110023. doi: 10.1016/j.chaos.2020.110023
[21]
Wu K, Darcet D, Wang Q, et al. Generalized logistic growth modeling of the COVID-19 outbreak in 29 provinces in China and in the rest of the world (2020) .Available from: https://arxiv.org/abs/2003.05681.
[22]
Shen C (2020) Logistic growth modelling of COVID-19 proliferation in China and its international implications. Int J Infect Dis 96: 582-589. doi: 10.1016/j.ijid.2020.04.085
[23]
Materassi M (2019) Some fractal thoughts about the COVID-19 infection outbreak. Chaos, Solitons Fractals X4: 100032. doi: 10.1016/j.csfx.2020.100032
[24]
Al-qaness M, Ewees A, Fan H, et al. (2020) Optimization Method for Forecasting Confirmed Cases of COVID-19 in China. J Clin Med 9: 674. doi: 10.3390/jcm9030674
[25]
He S, Peng Y, Sun K (2020) SEIR modeling of the COVID-19 and its dynamics. Nonlinear Dyn 101: 1667-1680. doi: 10.1007/s11071-020-05743-y
[26]
Paggi M (2020) An Analysis of the Italian Lockdown in Retrospective Using Particle Swarm Optimization in Machine Learning Applied to an Epidemiological Model. Physics 2: 368-382. doi: 10.3390/physics2030020
[27]
Boubaker S (2017) Identification of nonlinear Hammerstein system using mixed integer-real coded particle swarm optimization: application to the electric daily peak-load forecasting. Nonlinear Dyn 90: 797-814. doi: 10.1007/s11071-017-3693-9
Alberti T, Faranda D (2020) On the uncertainty of real-time predictions of epidemic growths: A COVID-19 case study for China and Italy. Commun Nonlinear Sci Numer Simulat 90: 105372. doi: 10.1016/j.cnsns.2020.105372
[30]
Consolini G, Materassi M (2020) A stretched logistic equation for pandemic spreading. Chaos, Solitons Fractals 140: 110113. doi: 10.1016/j.chaos.2020.110113
[31]
Roosa K, Lee Y, Luo R, et al. (2020) Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020. Infect Dis Modell 5: 256-263. doi: 10.1016/j.idm.2020.02.002
Pelinovsky E, Kurkin A, Kurkina O, et al. (2020) Logistic equation and COVID-19. Chaos, Solitons Fractals 140: 110241. doi: 10.1016/j.chaos.2020.110241
Faranda D, Alberti T Modelling the second wave of COVID-19 infections in France and Italy via a Stochastic SEIR model (2020) .Available from: https://arxiv.org/pdf/2006.05081v1.pdf.
[36]
Mwalili S, Kimathi M, Ojiambo V, et al. (2020) SEIR model for COVID-19 dynamics incorporating the environment and social Distancing. BMC Res Notes 13: 352. doi: 10.1186/s13104-020-05192-1
[37]
Castorina P, Iorio A (2020) Data analysis on Coronavirus spreading by macroscopic growth laws. Int J Mod Phys C 7: 2050103. doi: 10.1142/S012918312050103X
[38]
Vasconcelos G, Macêdo A, Ospina R, et al. (2020) Modelling fatality curves of COVID-19 and the effectiveness of intervention strategies. Peer J 8: e9421. doi: 10.7717/peerj.9421
[39]
Maier B, Brockmann D (2020) Effective containment explains subexponential growth in recent confirmed COVID-19 cases in China. Science 368: 742-746. doi: 10.1126/science.abb4557
[40]
Crokidakis N (2020) COVID-19 spreading in Rio de Janeiro, Brazil: Do the policies of social isolation really work? Chaos, Solitons Fractals 136: 109930. doi: 10.1016/j.chaos.2020.109930
This article has been cited by:
1.
Jacqueline Duhon, Nicola Bragazzi, Jude Dzevela Kong,
The impact of non-pharmaceutical interventions, demographic, social, and climatic factors on the initial growth rate of COVID-19: A cross-country study,
2021,
760,
00489697,
144325,
10.1016/j.scitotenv.2020.144325
Rafat Zrieq, Souad Kamel, Sahbi Boubaker, Fahad D. Algahtani, Mohamed Ali Alzain, Fares Alshammari, Fahad Saud Alshammari, Badr Khalaf Aldhmadi, Suleman Atique, Mohammad A. A. Al-Najjar, Sandro C. Villareal,
Time-Series Analysis and Healthcare Implications of COVID-19 Pandemic in Saudi Arabia,
2022,
10,
2227-9032,
1874,
10.3390/healthcare10101874
4.
Adam P. Piotrowski, Agnieszka E. Piotrowska,
Differential evolution and particle swarm optimization against COVID-19,
2022,
55,
0269-2821,
2149,
10.1007/s10462-021-10052-w
5.
Elizabeth Jordan, Delia E. Shin, Surbhi Leekha, Shapour Azarm,
Optimization in the Context of COVID-19 Prediction and Control: A Literature Review,
2021,
9,
2169-3536,
130072,
10.1109/ACCESS.2021.3113812
6.
Abdennour Sebbagh, Sihem Kechida,
EKF-SIRD model algorithm for predicting the coronavirus (COVID-19) spreading dynamics,
2022,
12,
2045-2322,
10.1038/s41598-022-16496-6
7.
Rafat Zrieq, Souad Kamel, Sahbi Boubaker, Fahad Algahtani, Mohamed Alzain, Fares Alshammari, Badr Aldhmadi, Fahad Alshammari, Marcos J. Araúzo-Bravo,
Predictability of COVID-19 Infections Based on Deep Learning and Historical Data,
2022,
12,
2076-3417,
8029,
10.3390/app12168029
8.
Bernardo F. Quiroga, Cristián Vásquez, M. Ignacia Vicuña,
Nonlinear time‐series forecasts for decision support: short‐term demand for ICU beds in Santiago, Chile, during the 2021 COVID‐19 pandemic,
2022,
0969-6016,
10.1111/itor.13222
9.
Bernardo F. Quiroga, Cristián Alberto Vásquez, María Ignacia Vicuña,
Nonlinear Time Series Forecasts for Decision Support: Short-Term Demand for ICU Beds in Santiago, Chile During the 2021 COVID-19 Pandemic,
2021,
1556-5068,
10.2139/ssrn.3922370
Mohamed Haouari, Mariem Mhiri,
A particle swarm optimization approach for predicting the number of COVID-19 deaths,
2021,
11,
2045-2322,
10.1038/s41598-021-96057-5
12.
Abdulwahab Ali Almazroi, Raja Sher Afgun Usmani,
COVID-19 Cases Prediction in Saudi Arabia Using Tree-based Ensemble Models,
2022,
32,
1079-8587,
389,
10.32604/iasc.2022.020588
13.
Hadi Barzegar, Alireza Eshghi, Abtin Ijadi Maghsoodi, Amir Mosavi,
Optimal Control for Economic Development During the Pandemic,
2024,
12,
2169-3536,
2445,
10.1109/ACCESS.2023.3337825
14.
Luiz Alberto Junior Letti, Andressa Tedesco Andretta, Walter José Martinez Burgos, Fernando Enrique Rosas Vega, Carlos Ricardo Soccol,
2024,
Chapter 7,
978-3-031-55967-9,
131,
10.1007/978-3-031-55968-6_7
15.
Neriman Kartal,
Dynamics of a Conformable Fractional Order Generalized Richards Growth Model on Star Network with N=20 Nodes,
2024,
45,
2587-2680,
117,
10.17776/csj.1385759
16.
Raqqasyi Rahmatullah Musafir, Syaiful Anam,
PARAMETER ESTIMATION OF COVID-19 COMPARTMENT MODEL IN INDONESIA USING PARTICLE SWARM OPTIMIZATION,
2022,
10,
2541-092X,
283,
10.20473/jbe.V10I32022.283-292
17.
Wolfgang Rauch, Hannes Schenk, Nikolaus Rauch, Matthias Harders, Herbert Oberacher, Heribert Insam, Rudolf Markt, Norbert Kreuzinger,
Estimating actual SARS-CoV-2 infections from secondary data,
2024,
14,
2045-2322,
10.1038/s41598-024-57238-0
18.
Carlito Pinto, Koichi Shimakawa, Hidekazu Fukai,
Analyzing the COVID-19 transmission dynamics with the stretched logistic model: Origin of anomalous time-dependent infection rates,
2025,
15,
2158-3226,
10.1063/5.0248156
Rafat Zreiq, Souad Kamel, Sahbi Boubaker, Asma A Al-Shammary, Fahad D Algahtani, Fares Alshammari. Generalized Richards model for predicting COVID-19 dynamics in Saudi Arabia based on particle swarm optimization Algorithm[J]. AIMS Public Health, 2020, 7(4): 828-843. doi: 10.3934/publichealth.2020064
Rafat Zreiq, Souad Kamel, Sahbi Boubaker, Asma A Al-Shammary, Fahad D Algahtani, Fares Alshammari. Generalized Richards model for predicting COVID-19 dynamics in Saudi Arabia based on particle swarm optimization Algorithm[J]. AIMS Public Health, 2020, 7(4): 828-843. doi: 10.3934/publichealth.2020064
Times-series ARIMA model and its variants for predicting the next four weeks infections
• R2: ranges from 0.46 to 0.99 • MAPE: ranges from 2.6% to 32.80%
Although the performance metrics are reported to be good, checking back the forecasted values showed the model to overestimate the number of cumulative infected case in the four weeks after the end of the dataset as well as an inaccurate end date of the outbreak
Logistic growth and SIR models for predicting more than one month of daily infected cases
• Growth model end date around the end of June 2020 with about 70,000 of total cases • SIR model end date was forecasted to be around the end of June and a cumulative number of 80,000
Both models are found to be inaccurate since the virus continues spreading as of October 18, 2020
Using a modified SIR model including parameters related to temperature, humidity, population density and the intensity of control measures taken by local governments.
Daily new cases found to range from 1,800 to 2,500 between May 15, 2020 and May 26, 2020
Forecasts where relatively accurate when compared to the reported cases
GRM (this work)
Daily from March 2, 2020 to October 10, 2020
Using the generalized Richards Model (GRM) for predicting cumulative infected cases
• R2: 0.9953 • MAPE: 3.2889% • Predicted end date: end of 2020 • Predicted cumulative number at the end of the outbreak: 378,299
Times-series ARIMA model and its variants for predicting the next four weeks infections
• R2: ranges from 0.46 to 0.99 • MAPE: ranges from 2.6% to 32.80%
Although the performance metrics are reported to be good, checking back the forecasted values showed the model to overestimate the number of cumulative infected case in the four weeks after the end of the dataset as well as an inaccurate end date of the outbreak
Logistic growth and SIR models for predicting more than one month of daily infected cases
• Growth model end date around the end of June 2020 with about 70,000 of total cases • SIR model end date was forecasted to be around the end of June and a cumulative number of 80,000
Both models are found to be inaccurate since the virus continues spreading as of October 18, 2020
Using a modified SIR model including parameters related to temperature, humidity, population density and the intensity of control measures taken by local governments.
Daily new cases found to range from 1,800 to 2,500 between May 15, 2020 and May 26, 2020
Forecasts where relatively accurate when compared to the reported cases
GRM (this work)
Daily from March 2, 2020 to October 10, 2020
Using the generalized Richards Model (GRM) for predicting cumulative infected cases
• R2: 0.9953 • MAPE: 3.2889% • Predicted end date: end of 2020 • Predicted cumulative number at the end of the outbreak: 378,299
Figure 1. New daily and cumulative infected cases for Saudi Arabia between March 2 and October 10, 2020