More than ten million deaths make influenza virus one of the deadliest of history. About half a million sever illnesses are annually reported consequent of influenza. Influenza is a parasite which needs the host cellular machinery to replicate its genome. To reach the host, viral proteins need to interact with the host proteins. Therefore, identification of host-virus protein interaction network (HVIN) is one of the crucial steps in treating viral diseases. Being expensive, time-consuming and laborious of HVIN experimental identification, force the researches to use computational methods instead of experimental ones to obtain a better understanding of HVIN. In this study, several features are extracted from physicochemical properties of amino acids, combined with different centralities of human protein-protein interaction network (HPPIN) to predict protein-protein interactions between human proteins and Alphainfluenzavirus proteins (HI-PPIs). Ensemble learning methods were used to predict such PPIs. Our model reached 0.93 accuracy, 0.91 sensitivity and 0.95 specificity. Moreover, a database including 694522 new PPIs was constructed by prediction results of the model. Further analysis showed that HPPIN centralities, gene ontology semantic similarity and conjoint triad of virus proteins are the most important features to predict HI-PPIs.
Citation: Babak Khorsand, Abdorreza Savadi, Javad Zahiri, Mahmoud Naghibzadeh. Alpha influenza virus infiltration prediction using virus-human protein-protein interaction network[J]. Mathematical Biosciences and Engineering, 2020, 17(4): 3109-3129. doi: 10.3934/mbe.2020176
[1] | Humaid Abdullah ALHinai, Azrul Mohd Ariffin, Miszaina Osman . Revolutionizing Oman's energy network with an optimal mixture renewable energy source. AIMS Energy, 2023, 11(4): 628-662. doi: 10.3934/energy.2023032 |
[2] | Halima Djeldjli, Djelloul Benatiallah, Camel Tanougast, Ali Benatiallah . Solar radiation forecasting based on ANN, SVM and a novel hybrid FFA-ANN model: A case study of six cities south of Algeria. AIMS Energy, 2024, 12(1): 62-83. doi: 10.3934/energy.2024004 |
[3] | Marwa M. Ibrahim, Amr A. Elfeky, Amal El Berry . Forecasting energy production of a PV system connected by using NARX neural network model. AIMS Energy, 2024, 12(5): 968-983. doi: 10.3934/energy.2024045 |
[4] | Chris Thankan, August Winters, Jin Ho Jo, Matt Aldeman . Feasibility of applying Illinois Solar for All (ILSFA) to the Bloomington Normal Water Reclamation District. AIMS Energy, 2021, 9(1): 117-137. doi: 10.3934/energy.2021007 |
[5] | Djelloul Benatiallah, Ali Benatiallah, Kada Bouchouicha, Bahous Nasri . Estimation of clear sky global solar radiation in Algeria. AIMS Energy, 2019, 7(6): 710-727. doi: 10.3934/energy.2019.6.710 |
[6] | Ming-Tang Tsai, Chih-Jung Huang . Integration of the radial basis functional network and sliding mode control for the sunshine radiation forecast. AIMS Energy, 2024, 12(1): 31-44. doi: 10.3934/energy.2024002 |
[7] | Daniel Chuquin-Vasco, Cristina Calderón-Tapia, Nelson Chuquin-Vasco, María Núñez-Moreno, Diana Aguirre-Ruiz, Vanesa G. Lo-Iacono-Ferreira . Mathematical modeling of a binary ORC operated with solar collectors. Case study—Ecuador. AIMS Energy, 2023, 11(6): 1153-1178. doi: 10.3934/energy.2023053 |
[8] | Muhammad Farhan Hanif, Muhammad Sabir Naveed, Mohamed Metwaly, Jicang Si, Xiangtao Liu, Jianchun Mi . Advancing solar energy forecasting with modified ANN and light GBM learning algorithms. AIMS Energy, 2024, 12(2): 350-386. doi: 10.3934/energy.2024017 |
[9] | Abdulrahman Almutlaq . Indoor experiments of a horizontal multiple effects diffusion solar still: Influence of heat input and the number of stages. AIMS Energy, 2024, 12(2): 532-547. doi: 10.3934/energy.2024025 |
[10] | Abdulrahman Th. Mohammad, Wisam A. M. Al-Shohani . Numerical and experimental investigation for analyzing the temperature influence on the performance of photovoltaic module. AIMS Energy, 2022, 10(5): 1026-1045. doi: 10.3934/energy.2022047 |
More than ten million deaths make influenza virus one of the deadliest of history. About half a million sever illnesses are annually reported consequent of influenza. Influenza is a parasite which needs the host cellular machinery to replicate its genome. To reach the host, viral proteins need to interact with the host proteins. Therefore, identification of host-virus protein interaction network (HVIN) is one of the crucial steps in treating viral diseases. Being expensive, time-consuming and laborious of HVIN experimental identification, force the researches to use computational methods instead of experimental ones to obtain a better understanding of HVIN. In this study, several features are extracted from physicochemical properties of amino acids, combined with different centralities of human protein-protein interaction network (HPPIN) to predict protein-protein interactions between human proteins and Alphainfluenzavirus proteins (HI-PPIs). Ensemble learning methods were used to predict such PPIs. Our model reached 0.93 accuracy, 0.91 sensitivity and 0.95 specificity. Moreover, a database including 694522 new PPIs was constructed by prediction results of the model. Further analysis showed that HPPIN centralities, gene ontology semantic similarity and conjoint triad of virus proteins are the most important features to predict HI-PPIs.
With the increased concern and interest in energy preservation and environmental protection, the world today is moving into a new era; transition from almost total dependence of the fossil fuel to an increased use of alternative sources of energy. Solar radiation is one of the promising and potential renewable energy sources especially in regions like UAE.
An accurate and detailed long-term knowledge of the available global solar irradiance on horizontal surfaces is of a major importance for the design and development of solar energy systems in a given region. Information about solar radiation can be obtained by installing expensive measuring sensors (pyranometers) at as many locations as possible in this region thus, requiring daily maintenance and data acquisition; consequently, increasing the cost of collecting solar radiation data. In most of the cases, the potential sites for solar energy implementation are not covered by measuring stations, especially in the deserted regions. Many countries do not have sufficient network of weather stations for collecting solar data. For such regions, empirical models have to be developed using meteorological data from available measurement stations. These models are then used to estimate solar irradiance values at other locations in the region where solar energy systems are planned [2].
UAE is among countries having potential for solar energy where the solar irradiance has significant strength, the average annual solar hours is approximately 3568 h (i.e. 9.7 h/day), which corresponds to an average annual global solar irradiance of approximately 2285 kWh/m2 (i.e. 6.3 kWh/m2 per day) [2].
Numerous researchers have developed statistical and empirical regression models to predict the monthly average daily global solar irradiance in their regions using various weather parameters [3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20]. The mean daily sunshine duration and air delta temperature were the most available and commonly used parameters. The most popular model developed by researchers was the linear model by Angström-Prescott. This model establishes a linear relationship between global solar irradiance and sunshine duration taking into account extra-terrestrial solar irradiance and the theoretical maximum daily sunshine hours. Many studies with empirical regression and machine learning models were presented in the literature for many regions around the world. Recently, different models predicting global solar irradiance using various meteorological and climatological variables have been published [16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37].
Assi et al. [25,26] used four meteorological 12-years data between 1995 and 2007 to train and validate a Feed Forward ANN-based estimation system of solar radiation in Al-Ain city in UAE. The authors examined several MLP architectures and tested more than twelve alternatives based on various derivatives of back-propagation training algorithms.
Antonanzas et al. [27] presented a new methodology to build parametric models for the estimation of global solar irradiation. The models were adjusted to specific on-site characteristics based on an evaluation of the variables importance. Authors have adjusted general parametric models such as the Bristow and Campbell BC models [27] with the on-site particularities. The presented methodology was appropriate for the investigated case study. The daily range of maximum and minimum temperatures, the logical variable of rainfall, and the daily mean wind speed were among the parameters that showed higher correlation with solar irradiance and that were included in the newly developed models
Ahmed and Adam [28] applied a feed forward back-propagation neural network on weather data measured at Qena-Egypt during the year 2007. The proposed approach used location coordinates and sunshine hours to estimate monthly average daily global solar radiation. The authors presented a comparative study between the described MLP-based approach and other empirical models. Based on their experimental results, authors showed the advantages of the MLP-based estimation technique for solar radiation estimation over the existing empirical regression models. Khatib et al. [29] developed a feed forward multi-layer perceptron with four inputs: longitude, latitude, day of the month, and sunshine radiation to predict the clearance index. The clearance index helped in calculating the solar irradiation. The models used long term solar radiation data for 28 sites in Malaysia measured between years 1984 and 2004.
Ramedani et al. [30] investigated two models based on Support Vector Regression technique (SVR) which is a type of Support Vector Machines (SVM) for predicting Global solar Radiation GSR in Tehran province. The authors examined two kernel functions for SVR: a radial basis function and a polynomial function. The authors designed and validated their approach on a measured daily data consisting on Temperature and sunshine parameters and belonging to seven-year period. The proposed approach, mainly the one based on radial basis function (SVR-rbf) showed better performance when compared to an ANN-based and a Neuro-Fuzzy based systems. Olatomiwa et al. [31] proposed a hybrid approach for predicting solar radiation based on SVMs coupled with the meta-heuristic Firefly algorithm FFA. The FFA has been applied to detect the optimal parameters of the SVM algorithm. The performance of the proposed approach showed superiority comparing to others based on ANN and GP when they have been tested on temperature and sunshine hour's data records collected from three different regions in Nigeria. Mohamadani et al. [32] presented a comparative evaluation among three soft computing methodologies for estimating global solar radiation in a specific region in Iran based on temperature measures. The developed models are an Adaptive Neuro-Fuzzy inference system ANFIS, a radial basis function SVR (SVR-rbf), and a polynomial basis function SVR (SVR-poly). The statistical analysis showed a superiority for the SVR-rbf over the two remaining examined models when validated on daily temperature measures. Kizi [33] proposed a Fuzzy-Genetic FG approach to model and predict solar radiation. The heuristic genetic algorithm has been used to find the optimum parameters of the Fuzzy inference method. The author used latitude, longitude, and altitude as inputs to the FG model to estimate one month ahead solar radiation in some regions in Turkey.
Recently, many researchers tried to instigate the most significant meteorological variables and parameters for estimating and predicting GSR [34,35]. Mohamadi et al. [34] examined the influence of meteorological parameters on horizontal GSR. They examined nine climatological parameters collected from three different cities in Iran. The authors applied an adaptive neuro-fuzzy inference technique in their selection procedure, and they determined the most influential parameters' combinations for each city and have concluded that it is not possible to introduce an optimal combination of inputs for all cities. They justified their conclusion by the fact that GSR, which is special for each region, depends on climate conditions and geographical location that are special for each region.
Demirhan and Atilgan [35] presented a robust coplot optimization approach coupled with a GP technique for solar radiation estimation. The robust coplot analysis technique has been applied on the measures of solar radiation and other related parameters to identify the optimal set of covariates in a data that consists of solar radiation, meteorological, and terrestrial variables. The main goal was to handle the multicollinearity problem that may exist among variables, and to eliminate the effect of outliers on the space of solar radiation modeling. The optimal set of covariates have been then used in a GP technique to construct monthly and yearly solar radiation estimation models. Pan Ⅰ et al. [36] presented a GP-based approach for predicting solar radiation using six geographical and sunshine duration data from India. The authors introduced what they called Multi-Gene Genetic Programming (MGGP) models where each individual solution, named a gene, is composed by a weighted combination of sub-individuals named Single-Gene Genetic Programming (SGGP) models. The authors indicated that the MGGP based approach has outperformed the other ones based on simple individual SGGP models as well as other classical regression models.
The current article investigates the prediction of global solar irradiance on horizontal using evolutional computational technique, namely the genetic programming (GP). Recently, the GP techniques showed good performance and flexibility in modelling non-linear regression problems [38,39]. Practically, The GP demonstrates its advantages in dynamically building complex formulas (solutions), and its flexibility to choose a set of functions and operators that match the problem to be solved. Such flexibility is possible due to the fact that the structure of the binary trees that represent solution candidates can be dynamically changed during the evolutionary process. These characteristics give the GP the ability to skip out of the local minima problem commonly found in the neural networks models especially in their feed-forward structures with back-propagation training algorithms. In the solar radiation estimation literature, the Genetic Algorithm GA has been used to select the optimal parameters of machine learning based models [33], whereas the GP algorithm has been used as core models of estimation [35,36].
In this article, the design and validation of a new GP based approach to estimate global solar irradiance using meteorological data will be described. The main idea is to find the best model for the relation between a set of meteorological parameters and the solar irradiance on a specific geographical area. Two approaches have been validated: A reference approach that consists of one global model that estimates the solar irradiance with respect to four climatological parameters, and a second approach that consists of a set of several models in a parallel structure. Each model consists of a nonlinear function that is dedicated to estimate the global solar irradiance in a specific seasonal or bi-seasonal period of the year. The experimental results indicated the advantages of using such type of multi-model structure when dealing with a set of data with large variability during the year. The remaining of this article is organized as follows: In section 2, the genetic programming is explained as an optimization heuristic technique, the GPLAB toolbox of MATLAB® that has been used in the adopted approach is then introduced. In section 3, the GP based reference approach that consists of one estimation function is described. Then, an enhanced approach also based on GP is given. Moreover, the dataset used for the design and the validation of the proposed approaches is introduced and described in this section. Discussion of the results is presented in section 4. Finally, section 5 includes conclusions and future perspectives.
The GP is an extension of the conventional genetic algorithm [40,41]. Genetic Algorithm (GA) is a metaheuristic method usually used to find an optimal solution in optimization problems based on a natural selection process. The GA starts by an initial population of random individuals. Each individual is represented by what is called chromosome that is an array of genes. Each gene represents a parameter to be optimized. Every individual (chromosome) represents a possible solution of the optimization problem and has its own fitness measure. The fitness is a measure for each individual that indicates how the solution related to this individual is suitable to solve the problem.
The GA uses the so called genetic operators: crossover, mutation, and cloning to evolve from a population to another until it reaches a population that consists of an optimal solution based on a chosen fitness objective function [40,41]. It starts by an initial random population of individuals. Then, in each generation, the GA performs the following steps:
-Select from the present population the individuals that have the best computed fitness.
-Use the best individuals to generate the next population by the crossover of those individuals. This is similar to the biological reproduction based on natural selection. A new individual has part of its genes coming from the first parent and the other part from the second parent.
-Based on a computed probability, a mutation operator may be applied to one or many chromosomes by changing the value of one of its genes. Similarly, and based on another computed probability, a chromosome with good fitness may be cloned and promoted to evolve to the next generation.
This procedure is repeated until an optimum individual is found. Figure 1 illustrates the operations of crossover and mutation of two individuals to produce new individuals for the next generation.
The GP aims to find the best computer program (function) that is composed of both data and operators and that solves a specific problem [41,42,43]. A chromosome in GP is represented by a binary tree data structure where internal nodes represent algebraic and/or logical operators whereas the external ones represent numbers and parameters related to the problem to be solved. Figure 2 shows examples of binary trees that represent mathematical expressions/functions. A function can be considered as a computer program that consists of a set of data (terminals) and actions (operators).
The process of evolution in GP starts by an initial random population of chromosomes that represent possible solutions (functions) and tries to generate new populations subsequently [42].
The evolution is controlled by a fitness function that is equivalent to the objective function adopted in local heuristic search techniques. The fitness function is special for each optimization problem and allows the evaluation of fitness of each chromosome (solution) in a population. Figure 3 illustrates the effect of the crossover operator on two selected individuals.
GPLAB is a Genetic Programming toolbox for MATLAB® [44]. GPLAB provides most of the features and operators commonly used in GP. Its modular structure allows considering it as an extendable tool that is suitable for prototyping new techniques of heuristic local search in GP. GPLAB enables a set of facilities and features to handle and control the structure and the size of both: the chromosomes that represent individual solutions and the populations that represent sets of those individuals. In addition, GPLAB allows the dynamic control of the variable size of populations during run time. This feature is indeed important in case of limited computational resources [45,46,47]. Moreover, GPLAB implements a technique for automatically adjusting the probabilities of adopted genetic operators during runtime. This feature allows the use of the GPLAB toolbox as a test workbench for new genetic operators
In this work, a GP based approach to estimate solar irradiance is designed, implemented, and validated. In the first phase of this work, a reference system is proposed, which consists of a single function that can model the relation between solar irradiance and a set of climatological factors in a specific geographical area. The reference system showed promising performance. In the second phase, the performance of the reference system is analyzed and an enhanced one that consists of multiple independent models is suggested. Each model is dedicated to estimate the solar radiation amount in a specific seasonal period of the year. All components of the two proposed approaches were designed, implemented, and validated using the GPLAB toolbox.
In the design and validation phases, a meteorological dataset provided by the National Center of meteorology and Seismology (NCMS) in Abu-Dhabi—UAE is used. The dataset consists of daily data records for the period between 2004 and 2007. Each daily record includes the measures of: air temperature, wind speed, relative humidity, and sunshine duration. The dataset has been divided into two subsets; a design subset having records for the years between 2004 and 2006 inclusive and a test subset that includes records for the year 2007. Table 1 shows some samples of the dataset. In this table, the first four columns represent the four meteorological records.
Max Temperature | Mean wind Speed (knot) | Sun Hours | Mean Relative Humidity % | Total Radiation (kWh/sq.m) |
25.7 | 5.6 | 8.9 | 68 | 462.8 |
24.5 | 7.1 | 9.6 | 67 | 459.3 |
23.0 | 5.4 | 10.2 | 54 | 515.5 |
25.1 | 6.4 | 10.1 | 47 | 505.0 |
23.6 | 8.3 | 1.6 | 63 | 230.6 |
The problem of developing an appropriate function that models the relation previously discussed, looks like a search problem for an optimal state that represents the expected function. In our case, the targeted function includes operand and operators. The operands consist of the climatological parameters and other constants that may appear in the resulting function. Besides, the set of operators may include arithmetic operators, exponential operators, and any other algebraic or non-algebraic ones like: square root, natural log, exponentials, etc.
The functions investigated in this work can be represented by binary trees data structures, where the internal nodes represent the operators whereas the terminals represent the operands. Figure 4 illustrates an example of a binary tree that represents a solution candidate. In Figure 4, X1, X2, X3, and X4 represent temperature, wind speed, sun hours, and humidity respectively.
Equation (1) shows the algebraic function that can be obtained using the binary tree of Figure 4. In this equation, sr, tmp, sh, ws, and hum stand for solar radiation, temperature, sun hours, wind speed, and humidity respectively. The internal nodes in the tree include the four arithmetic operators: addition (+), difference (-), and division (/), and multiplication (*), plus it includes the decimal log, as well as the square root (√). Figure 4 shows that the depth of the tree is equal to six. Actually, the maximum depth allowed for trees in each population is one of the adjustable parameters in a GP evolution process.
sr=√tmp∗hum∗log(sh)+(tmp−(ws−sh))−((hum−ws)−(ws−tmp)log(ws)+sh) | (1) |
In this work, fixed values for the following parameters have been adopted:
-The population size is set to be equal to 500 individuals. In our experiments the performance of optimization has been slightly affected by variations in the size of population.
-The fitness function that evaluates the efficiency of candidate solutions (chromosomes). A fitness function related to the root mean square error (RMSE) has been used.
-The depth of each of the binary trees that represents chromosomes (solution candidate) is set to be dynamic. The maximum value of that depth is chosen to be equal to six. GPLAB provides a technique that permits to start by an initial depth of trees that can be dynamically increased until a selected maximum value.
On the other hand, alternatives have been investigated:
-The probability of applying each of the genetic operators (crossover and mutation).
-The sampling method to select individuals from the current population to participate in generating new individuals for the next generation.
The first three columns in Table 2 show combinations of parameters adopted in designing the proposed approach. The implemented fitness function computes, for each individual, the RMSE between the set of exact output values available in the design dataset and the output values returned by that individual. Equation (2) describes the RMSE computation.
RMSE=√∑Ni=1(Hpi−Hi)2N | (2) |
Operators probability | Sampling method | Set of functions | Best fitness in last population |
Crossover prob. : 0.85 Mutation prob. : 0.15 | Roulette | {arithmetic, √, log10} | 4.402 |
Crossover prob. : 0.85 Mutation prob. : 0.15 | Tournament | {arithmetic, √, log10} | 4.491 |
Dynamic | Roulette | {arithmetic, √, log10} | 4.503 |
Dynamic | Tournament | {arithmetic, √, log10} | 4.174 |
Where Hpi represents the estimated value of global solar irradiance, Hi is the measured value that is available in the design dataset, and N is the total number of records in that dataset.
As for the probabilities of applying each of the genetic operators, the GPLAB allows either to fix the values of those probabilities or to dynamically compute them at each iteration during the run time. The computation in this case is based on the history of each operator in producing individuals with the best fitness and on statistics about the newly produced individuals [44]. The results presented in the right most column of Table 1 indicate that the best fitness value (in this case the lowest) is the one related to dynamic probabilities of operators and tournament sampling method. Figure 5 shows the binary tree associated to the best individual in the last population of the best combination. Equation (3) represents the function stored in that binary tree.
sr=tmp+sh+sh∗log(sh)−log|tmp−hum|+sh−(hum−wstmp+ws) | (3) |
The results obtained using the reference system show remarkable difference between the measured values and the estimated ones. Figure 7 compares the measured and the estimated monthly average daily global solar irradiance values.
The records of the design dataset have been investigated, and the values of each meteorological factor have been analyzed. Analysis showed that the values of some factors, especially the humidity, have wide variations, i.e. a large deviation around the average value over a year. Such variations make the search for an optimal model quite difficult.
One of the suggestions to improve the whole performance is by estimating the global average solar irradiance over relatively short period of time in a year by using a multi-model approach. The main idea is to find the function with best fitness for estimating the global solar irradiance for each seasonal period of the year. Applying this strategy lead to build proficient functions. A function has been built for each two consecutive months of the year. Thus, the multi-mode system consists of six nonlinear functions. Figure 6 illustrates the structure of the proposed approach.
The evolutional computation process is launched with the same combination of parameters described in the 1st and 4th rows of Table 2. The estimation performance is significantly improved. Table 3 shows the obtained enhancement in terms of fitness of the best individual when the multi-model strategy is applied. The best performance is obtained with dynamic probabilities of genetic operators, tournament sampling, and set of functions that contains arithmetic, algebraic and logarithmic operators.
Operators probability | Sampling method | Set of functions | Best fitness in last population |
Crossover prob. : 0.85 Mutation prob. : 0.15 | Roulette | {arithmetic, √, log10} | 2.137 |
Dynamic | Tournament | {arithmetic, √, log10} | 1.218 |
Error statistical analysis showed good improvement as will be described in the next section.
The estimation performance of the suggested approach was assessed through a statistical analysis of error. The analysis was conducted by computing the RMSE and Mean Bias Error MBE that measure the variation of estimated values against the measured available ones. Low RMSE and MBE values are desired and indicate an accurate estimation. The RMSE computation is described earlier in equation (2), whereas the MBE computation is described in equation (3).
MBE=∑Ni=1(Hpi−Hi)N | (3) |
Table 4 compares the values of RMSE of the best two reference models and the new model that consists of parallel multi-functions.
Operators probability | Best fitness in last population | RMSE | MBE |
Operators prob.: [0.85, 0.15] Sampling: roulette Model: reference model |
4.402 | 1.862 | 1.322 |
Operators prob. : dynamic Sampling: tournament Model: reference model |
4.174 | 1.026 | 0.68 |
Operators prob. : dynamic Sampling: tournament Model: Parallel |
1.218 | 0.210 | 0.052 |
Figure 7 and Figure 8 show the measured and estimated values of monthly average daily global solar irradiance for the best reference model and the multi-model. The later shows better performance in estimating monthly average daily global solar irradiance.
Figure 9 shows the binary tree that represents one of the best functions in the multi-model approach. Equation (4) represents the function of the binary tree shown in this Figure.
sr=√sh∗humtmpsh+log(hum)sh−tmp+wstmp+sh | (4) |
The suggested genetic programming based approaches show comparable performance with respect to other empirical regression and neural models. Table 5 compares the RMSE of the results obtained by the suggested approach to those obtained by other models conducted by other groups.
Model | RMSE |
Quadratic Regression model | 0.214 |
Logarithmic Regression model | 0.259 |
ANN 3-10-4-1 | 0.374 |
ANN 3-20-8-1 | 0.391 |
GP model with parallel functions | 0.210 |
This article described new approaches for estimating global solar irradiance using meteorological records. The suggested methods are based on a GP heuristic technique. The first method (reference system) consists of estimating the nonlinear function that can model the relation between solar irradiance and four meteorological parameters. The performance of the reference system is promising. An enhanced model that consists of multi-function was proposed and it showed better performance with respect to the first method. The performance of the proposed approaches was evaluated using statistical analysis measures.
The GP showed its advantages in dynamically building complex formulas that represent solution candidates for the problem to be solved. As an evolutionary process, the GP shows its ability to resolve the local minima problem and to converge toward a global minima. The problem of local minima is commonly found in the neural networks models especially in their feed-forward structures commonly used in the literature coupled with the classical back-propagation training algorithm. Moreover, the GP technique provides analytical expressions as solutions, like the expression given in Equation (3), which is not available in most of the machine learning techniques, for instance the neural and the neuro-fuzzy models. The later property is important for researchers to understand the contribution of each variable input in the calculation of the dependent variable output.
In our experiments, we controlled the well-known bloat (inflation) problem of GP by using a set of techniques provided by the GPLAB environment [44]. Some of those techniques consist of automatic resizing of the population in runtime to save computational resources. Those techniques are adequate in cases of complex problems when the complexity of the expressions' models increases dramatically during the evolutionary process.
A similar approach to our enhanced one has been proposed in [35] with two main differences: First, in that approach the input data has been pre-analyzed by using an optimization technique to handle the multicollinearity problem that may exist among the variables, which is not available in our approach that has been applied on data records of four meteorological variables. Second, the approach in [35] consists of a set of twelve models, each model is dedicated to estimate the solar radiation in a specific month of the year whereas our approach consists of six models. Each of our parallel models is devoted to estimate the solar radiation for a semi-seasonal period of the year. In general, the increase of number of learning-based models requires more training data records to adjust those models during the design phase which may not be always possible. One of our future suggestions is to automate the splitting of data into seasonal subsets based on an automatic learning technique in order to optimize the estimation performance.
In this work, the proposed GP approach is not compared to other learning based estimation techniques of the same type. Such comparison needs comparing the convergence time of each approach as well as the performance of estimation using the same data sets and same leaning parameters. This could be one of our future perspectives. On the other hand, the obtained results in this work are comparable to those obtained by mathematical regressions and neural models that were conducted by other research groups. Finally, the obtained results showed the advantage of using the parallel modular structure over the global one.
Three main future perspectives can be drawn for this work. The first one consists of finding a way to automatically splitting the data into seasonal or semi seasonal periods in order to optimize the performance of estimation. The second perspective consists of comparing or GP based approach with other machine learning based techniques by using the same data sets. The third perspective consists of validating the proposed approach using new meteorological datasets with larger number of weather parameters in each record.
The authors would like to thank the National Center of Meteorology and Seismology (NCMS), Abu Dhabi for providing the weather data.
All authors declare on conflicts of interest in this paper.
[1] |
J. M. Langley, M. E. Faughnan, Prevention of influenza in the general population, Can. Med. Assoc. J., 171 (2004), 1213-1222. doi: 10.1503/cmaj.1031737
![]() |
[2] |
W. W. Thompson, D. K. Shay, E. Weintraub, L. Brammer, C. B. Bridges, et al., Influenza-associated hospitalizations in the United States, J. Am. Med. Assoc., 292 (2004), 1333-1340. doi: 10.1001/jama.292.11.1333
![]() |
[3] |
J. K. Taubenberger, D. M. Morens, The pathology of influenza virus infections, Annu. Rev. Pathol. Mech. Dis., 3 (2008), 499-522. doi: 10.1146/annurev.pathmechdis.3.121806.154316
![]() |
[4] |
A. Nagy, L. Černíková, V. Křivda, J. Horníčková, Digital genotyping of avian influenza viruses of H7 subtype detected in central Europe in 2007-2011, Virus Res., 165 (2012), 126-133. doi: 10.1016/j.virusres.2012.02.005
![]() |
[5] | Q Li, L Zhou, M Zhou, Z Chen, F Li, H Wu, et al., Preliminary report: Epidemiology of the avian influenza A (H7N9) outbreak in China, N. Engl. J. Med., 24 (2013), xi-xii. |
[6] |
Y. Hu, S. Lu, Z. Song, W. Wang, P. Hao, J. Li, et al., Association between adverse clinical outcome in human disease caused by novel influenza A H7N9 virus and sustained viral shedding and emergence of antiviral resistance, Lancet, 381 (2013), 2273-2279. doi: 10.1016/S0140-6736(13)61125-3
![]() |
[7] |
G. Neumann, T. Noda, Y. Kawaoka, Emergence and pandemic potential of swine-origin H1N1 influenza virus, Nature, 459 (2009), 931-939. doi: 10.1038/nature08157
![]() |
[8] | G. Lu, K. Buyyani, N. Goty, R. Donis, Z. Chen, Influenza a virus informatics: Genotype-centered database and genotype annotation, Second International Multi-Symposiums on Computer and Computational Sciences (IMSCCS 2007), 2007. Available from: https://ieeexplore.ieee.org/abstract/document/4392583. |
[9] |
A. Melidou, G. Gioula, M. Exindari, D. Chatzidimitriou, E. Diza, N. Malisiovas, Molecular and phylogenetic analysis of the haemagglutinin gene of pandemic influenza H1N1 2009 viruses associated with severe and fatal infections, Virus Res., 151 (2010), 192-199. doi: 10.1016/j.virusres.2010.05.005
![]() |
[10] |
E. D. Kilbourne, Influenza pandemics of the 20th century, Emerg. Infect. Dis., 12 (2006), 9. doi: 10.3201/eid1201.051254
![]() |
[11] | W. H. Organization, Ten things you need to know about pandemic influenza (update of 14 October 2005), Wkly. Epidemiol. Rec., 80 (2005), 428-431. |
[12] |
D. J. Smith, A. S. Lapedes, J. C. de Jong, T. M. Bestebrore, G. F. Rimmelzwaan, A. D. Osterhaus, et al., Mapping the antigenic and genetic evolution of influenza virus, Science, 305 (2004), 371-376. doi: 10.1126/science.1097211
![]() |
[13] |
J. K. Taubenberger, D. M. Morens, 1918 Influenza: the mother of all pandemics, Rev. Biomed., 17 (2006), 69-79. doi: 10.32776/revbiomed.v17i1.440
![]() |
[14] | A Patient, Swine influenza A (H1N1) infection in two children-Southern California, March-April 2009, Morb. Mortal. Wkly. Rep., 58 (2009), 400-402. |
[15] |
M. P. Girard, J. S. Tam, O. M. Assossou, M. P. Kieny, The 2009 A (H1N1) influenza virus pandemic: A review, Vaccine, 28 (2010), 4895-4902. doi: 10.1016/j.vaccine.2010.05.031
![]() |
[16] | E. Golemis, Protein-protein interactions: A molecular cloning manual, CSHL Press, (2005). |
[17] |
C. D. Hu, Y. Chinenov, T. K. Kerppola, Visualization of interactions among bZIP and Rel family proteins in living cells using bimolecular fluorescence complementation, Mol. Cell, 9 (2002), 789-798. doi: 10.1016/S1097-2765(02)00496-3
![]() |
[18] |
E. Sprinzak, H. Margalit, Correlated sequence-signatures as markers of protein-protein interaction11Edited by G. von Heijne, J. Mol. Biol., 311 (2001), 681-692. doi: 10.1006/jmbi.2001.4920
![]() |
[19] | W. K. Kim, J. Park, J. K. Suh, Large scale statistical prediction of protein-protein interaction by potentially interacting domain (PID) pair., Genome Inform., 13 (2002) 42-50. |
[20] |
S. K. Ng, Z. Zhang, S. H. Tan, Integrative approach for computationally inferring protein domain interactions, Bioinformatics, 19 (2003), 923-929. doi: 10.1093/bioinformatics/btg118
![]() |
[21] |
H. Yu, N. M. Luscombe, H. X. Lu, X. Zhu, Y. Xia, J. D. Han, et al., Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs, Genome Res., 14 (2004), 1107-1118. doi: 10.1101/gr.1774904
![]() |
[22] |
L. V Zhang, S. L. Wong, O. D. King, F. P. Roth, Predicting co-complexed protein pairs using genomic and proteomic data integration, BMC Bioinformatics, 5 (2004), 38. doi: 10.1186/1471-2105-5-38
![]() |
[23] |
R. Jansen, H. Yu, D. Greenbaum, Y. Kluger, N. J. Krogan, S. Chung, et al., A Bayesian networks approach for predicting protein-protein interactions from genomic data, Science, 302 (2003), 449-453. doi: 10.1126/science.1087361
![]() |
[24] |
Y. Qi, Z. Bar‐Joseph, J. Klein‐Seetharaman, Evaluation of different biological data and computational classification methods for use in protein interaction prediction, Proteins Struct. Funct. Bioinforma., 63 (2006), 490-500. doi: 10.1002/prot.20865
![]() |
[25] |
M. D. Dyer, T. M. Murali, B. W. Sobral, Computational prediction of host-pathogen protein-protein interactions, Bioinformatics, 23 (2007), 159-166. doi: 10.1093/bioinformatics/btm208
![]() |
[26] |
A. Emamjomeh, B. Goliaei, J. Zahiri, R. Ebrahimpour, Predicting protein-protein interactions between human and hepatitis C virus via an ensemble learning method, Mol Biosyst, 10 (2014), 3147-3154. doi: 10.1039/C4MB00410H
![]() |
[27] | O. Tastan, Y. Qi, J. G. Carbonell, J. Klein-Seetharaman, Prediction of interactions between HIV-1 and human proteins by information integration, in Biocomputing, World Scientific, (2009), 516-527. |
[28] |
Y. Qi, O. Tastan, J. G. Carbonell, J. Klein-Seetharaman, J. Weston, Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins, Bioinformatics, 26 (2010), i645-i652. doi: 10.1093/bioinformatics/btq394
![]() |
[29] | B. Barnes, M. Karimloo, A. Schoenrock, D. Burnside, E. Cassol, A. Wong, et al., Predicting novel protein-protein interactions between the HIV-1 virus and homo sapiens, 2016 IEEE EMBS International Student Conference (ISC), 2016. Available from: https://ieeexplore.ieee.org/abstract/document/7508598/. |
[30] | S. Alguwaizani, B. Park, X. Zhou, D. S. Huang, K. Han, Predicting Interactions between Virus and Host Proteins Using Repeat Patterns and Composition of Amino Acids, J. Healthc. Eng., 2018 (2018). |
[31] | N. Zhang, M. Jiang, T. Huang, Y. D. Cai, Identification of Influenza A/H7N9 virus infection-related human genes based on shortest paths in a virus-human protein interaction network, Biomed. Res. Int., 2014 (2014). |
[32] |
Gene Ontology Consortium, The Gene Ontology (GO) database and informatics resource, Nucleic Acids Res., 32 (2004), D258-D261. doi: 10.1093/nar/gkh036
![]() |
[33] | C. L. P. Eng, J. C. Tong, T. W. Tan, Predicting host tropism of influenza A virus proteins using random forest, BMC Med. Genomics, 7 (2014), S1. |
[34] | L. Nanni, A. Lumini, S. Brahnam, An Empirical Study of Different Approaches for Protein Classification, Sci. World J., 2014 (2014), 236717. |
[35] |
E. I. Zacharaki, Prediction of protein function using a deep convolutional neural network ensemble, PeerJ Comput. Sci., 3 (2017), e124. doi: 10.7717/peerj-cs.124
![]() |
[36] |
I. Saha, J. Zubek, T. Klingstrom, S. Forsberg, J. Wikander, M. Kierczak, et al., Ensemble learning prediction of protein-protein interactions using proteins functional annotations, Mol. Biosyst., 10 (2014), 820-830. doi: 10.1039/c3mb70486f
![]() |
[37] | L. Nanni, S. Brahnam, S. Ghidoni, A. Lumini, Toward a general-purpose heterogeneous ensemble for pattern classification, Comput. Intell. Neurosci., 2015 (2015). |
[38] | S. Kerrien, B. Aranda, L. Breuza, A. Bridge, F. Broackes-Carter, C. Chen, et al., The IntAct molecular interaction database in 2012, Nucleic Acids Res., 40 (2011), D841-D846. |
[39] |
A. Chatr-aryamontri, A. Ceol, D. Peluso, A. Nardozza, S. Panni, F. Sacco, et al., VirusMINT: A viral protein interaction database, Nucleic Acids Res., 37 (2009), D669-D673. doi: 10.1093/nar/gkn739
![]() |
[40] |
I. Xenarios, L. Salwinski, X. J. Duan, P. Higney, S. M. Kim, D. Eisenberg, DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions, Nucleic Acids Res., 30 (2002), 303-305. doi: 10.1093/nar/30.1.303
![]() |
[41] | D. Szklarczyk, A. Franceschini, S. Wyder, K. Forslund, D. Heller, J. Huerta-Cepas, et al., STRING v10: protein-protein interaction networks, integrated over the tree of life, Nucleic Acids Res., 43 (2014), D447-D452. |
[42] |
C. Stark, B. J. Breitkreutz, T. Reguly, L. Boucher, A. Breitkreutz, M. Tyers, BioGRID: A general repository for interaction datasets, Nucleic Acids Res., 34 (2006), D535-D539. doi: 10.1093/nar/gkj109
![]() |
[43] |
J. Yu, M. Guo, C. J. Needham, Y. Huang, L. Cai, D. R. Westhead, Simple sequence-based kernels do not predict protein-protein interactions, Bioinformatics, 26 (2010), 2610-2614. doi: 10.1093/bioinformatics/btq483
![]() |
[44] |
Y. Huang, B. Niu, Y. Gao, L. Fu, W. Li, CD-HIT Suite: A web server for clustering and comparing biological sequences, Bioinformatics, 26 (2010), 680-682. doi: 10.1093/bioinformatics/btq003
![]() |
[45] |
M. A. Tahir, J. Kittler, F. Yan, Inverse random under sampling for class imbalance problem and its application to multi-label classification, Pattern Recognit., 45 (2012), 3738-3750. doi: 10.1016/j.patcog.2012.03.014
![]() |
[46] |
S. Kawashima, P. Pokarowski, M. Pokarowska, A. Kolinski, T. Katayama, M. Kanehisa, AAindex: Amino acid index database, progress report 2008, Nucleic Acids Res, 36 (2008) D202-205. doi: 10.1093/nar/gkn255
![]() |
[47] | R. Bellman, R. Corporation, Dynamic Programming, Princeton University Press, (1957). |
[48] |
A. Wagner, Energy constraints on the evolution of gene expression, Mol. Biol. Evol., 22 (2005), 1365-1374. doi: 10.1093/molbev/msi126
![]() |
[49] |
P. M. Sharp, T. M. Tuohy, K. R. Mosurski, Codon usage in yeast: Cluster analysis clearly differentiates highly and lowly expressed genes., Nucleic Acids Res., 14 (1986), 5125-5143. doi: 10.1093/nar/14.13.5125
![]() |
[50] |
P. M. Sharp, W. H. Li, The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications, Nucleic Acids Res., 15 (1987), 1281-1295. doi: 10.1093/nar/15.3.1281
![]() |
[51] |
J. SantaLucia, A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics, Proc. Natl. Acad. Sci., 95 (1998), 1460-1465. doi: 10.1073/pnas.95.4.1460
![]() |
[52] |
P. Claverie, Calculation of interaction energy between triplets in the RNA 11 configuration, J. Mol. Biol., 56 (1971), 75-82. doi: 10.1016/0022-2836(71)90085-4
![]() |
[53] |
B. Lee, F. M. Richards, The interpretation of protein structures: estimation of static accessibility, J. Mol. Biol., 55 (1971), 379-IN4. doi: 10.1016/0022-2836(71)90324-X
![]() |
[54] |
P. Klein, M. Kanehisa, C. DeLisi, Prediction of protein function from sequence properties: Discriminant analysis of a data base, Biochim. Biophys. Acta, Protein Struct. Mol. Enzymol., 787 (1984) 221-226. doi: 10.1016/0167-4838(84)90312-1
![]() |
[55] |
Y. Guo, L. Yu, Z. Wen, M. Li, Using support vector machine combined with auto covariance to predict protein-protein interactions from protein sequences, Nucleic Acids Res., 36 (2008), 3025-3030. doi: 10.1093/nar/gkn159
![]() |
[56] |
X. Wu, E. Pang, K. Lin, Z. M. Pei, Improving the Measurement of Semantic Similarity between Gene Ontology Terms and Gene Products: Insights from an Edge- and IC-Based Hybrid Method, PLoS One, 8 (2013), e66745. doi: 10.1371/journal.pone.0066745
![]() |
[57] |
Y. R. Cho, W. Hwang, M. Ramanathan, A. Zhang, Semantic integration to identify overlapping functional modules in protein interaction networks, BMC Bioinformatics, 8 (2007) 265. doi: 10.1186/1471-2105-8-265
![]() |
[58] | P. Resnik, Using information content to evaluate semantic similarity in a taxonomy, arXiv Prepr. C., 1995 (1995). |
[59] | J. J. Jiang, D. W. Conrath, Semantic similarity based on corpus statistics and lexical taxonomy, arXiv Prepr. C., 1997 (1997). |
[60] | D. Lin, An information-theoretic definition of similarity, Icml, 98 (1998) 296-304. |
[61] |
D. H. Wolpert, Stacked Generalization, Neural Networks, 5 (1992), 241-259. doi: 10.1016/S0893-6080(05)80023-1
![]() |
[62] | B. Khorsand, EvaluationMeasures: Collection of Model Evaluation Measure Functions, CRAN, 2016 (2016). |
[63] |
D. W. Huang, B. T. Sherman, R. A. Lempicki, Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources, Nat. Protoc., 4 (2009), 44-57. doi: 10.1038/nprot.2008.211
![]() |
[64] |
F. Supek, M. Bošnjak, N. Škunca, T. Šmuc, REVIGO summarizes and visualizes long lists of gene ontology terms, PLoS One, 6 (2011), e21800. doi: 10.1371/journal.pone.0021800
![]() |
[65] |
H. Hotelling, Analysis of a complex of statistical variables into principal components, J. Educ. Psychol., 24 (1933), 417. doi: 10.1037/h0071325
![]() |
[66] |
R. Leardi, A. L. Gonzalez, Genetic algorithms applied to feature selection in PLS regression: how and when to use them, Chemom. Intell. Lab. Syst., 41 (1998), 195-207. doi: 10.1016/S0169-7439(98)00051-3
![]() |
1. | Rami Al-Hajj, Ali Assi, Mohamad M. Fouad, 2019, Stacking-Based Ensemble of Support Vector Regressors for One-Day Ahead Solar Irradiance Prediction, 978-1-7281-3587-8, 428, 10.1109/ICRERA47325.2019.8996629 | |
2. | Rami Al-Hajj, Ali Assi, Mohamad Fouad, Short-Term Prediction of Global Solar Radiation Energy Using Weather Data and Machine Learning Ensembles: A Comparative Study, 2021, 143, 0199-6231, 10.1115/1.4049624 | |
3. | Mohamad M. Fouad, Ali Ibrahim El-Desouky, Rami Al-Hajj, El-Sayed M. El-Kenawy, Dynamic Group-Based Cooperative Optimization Algorithm, 2020, 8, 2169-3536, 148378, 10.1109/ACCESS.2020.3015892 | |
4. | Abdelaziz Rabehi, Mawloud Guermoui, Djemoui Lalmi, Hybrid models for global solar radiation prediction: a case study, 2020, 41, 0143-0750, 31, 10.1080/01430750.2018.1443498 | |
5. | Rami Al-Hajj, Ali Assi, Mohamad M. Fouad, 2018, Forecasting Solar Radiation Strength Using Machine Learning Ensemble, 978-1-5386-5982-3, 184, 10.1109/ICRERA.2018.8567020 | |
6. | Rami Al-Hajj, Ali Assi, Mohamad Fouad, Emad Mabrouk, A Hybrid LSTM-Based Genetic Programming Approach for Short-Term Prediction of Global Solar Radiation Using Weather Data, 2021, 9, 2227-9717, 1187, 10.3390/pr9071187 | |
7. | El-Sayed M. El-kenawy, Abdelaziz A. Abdelhamid, Abdelhameed Ibrahim, Seyedali Mirjalili, Nima Khodadad, Mona A. Al duailij, Amel Ali Alhussan, Doaa Sami Khafaga, Al-Biruni Earth Radius (BER) Metaheuristic Search Optimization Algorithm, 2023, 45, 0267-6192, 1917, 10.32604/csse.2023.032497 | |
8. | Shivani Sehgal, Aman Ganesh, Vikram Kumar Kamboj, O. P. Malik, A Memetic Approach to Multi-Disciplinary Design and Numerical Optimization Problems using Intensify Slime Mould Optimizer, 2024, 0924-669X, 10.1007/s10489-023-05073-7 | |
9. | Harleenpal Singh, Sobhit Saxena, Himanshu Sharma, Vikram Kumar Kamboj, Krishan Arora, Gyanendra Prasad Joshi, Woong Cho, An integrative TLBO-driven hybrid grey wolf optimizer for the efficient resolution of multi-dimensional, nonlinear engineering problems, 2025, 15, 2045-2322, 10.1038/s41598-025-89458-3 |
Max Temperature | Mean wind Speed (knot) | Sun Hours | Mean Relative Humidity % | Total Radiation (kWh/sq.m) |
25.7 | 5.6 | 8.9 | 68 | 462.8 |
24.5 | 7.1 | 9.6 | 67 | 459.3 |
23.0 | 5.4 | 10.2 | 54 | 515.5 |
25.1 | 6.4 | 10.1 | 47 | 505.0 |
23.6 | 8.3 | 1.6 | 63 | 230.6 |
Operators probability | Sampling method | Set of functions | Best fitness in last population |
Crossover prob. : 0.85 Mutation prob. : 0.15 | Roulette | {arithmetic, √, log10} | 4.402 |
Crossover prob. : 0.85 Mutation prob. : 0.15 | Tournament | {arithmetic, √, log10} | 4.491 |
Dynamic | Roulette | {arithmetic, √, log10} | 4.503 |
Dynamic | Tournament | {arithmetic, √, log10} | 4.174 |
Operators probability | Sampling method | Set of functions | Best fitness in last population |
Crossover prob. : 0.85 Mutation prob. : 0.15 | Roulette | {arithmetic, √, log10} | 2.137 |
Dynamic | Tournament | {arithmetic, √, log10} | 1.218 |
Operators probability | Best fitness in last population | RMSE | MBE |
Operators prob.: [0.85, 0.15] Sampling: roulette Model: reference model |
4.402 | 1.862 | 1.322 |
Operators prob. : dynamic Sampling: tournament Model: reference model |
4.174 | 1.026 | 0.68 |
Operators prob. : dynamic Sampling: tournament Model: Parallel |
1.218 | 0.210 | 0.052 |
Model | RMSE |
Quadratic Regression model | 0.214 |
Logarithmic Regression model | 0.259 |
ANN 3-10-4-1 | 0.374 |
ANN 3-20-8-1 | 0.391 |
GP model with parallel functions | 0.210 |
Max Temperature | Mean wind Speed (knot) | Sun Hours | Mean Relative Humidity % | Total Radiation (kWh/sq.m) |
25.7 | 5.6 | 8.9 | 68 | 462.8 |
24.5 | 7.1 | 9.6 | 67 | 459.3 |
23.0 | 5.4 | 10.2 | 54 | 515.5 |
25.1 | 6.4 | 10.1 | 47 | 505.0 |
23.6 | 8.3 | 1.6 | 63 | 230.6 |
Operators probability | Sampling method | Set of functions | Best fitness in last population |
Crossover prob. : 0.85 Mutation prob. : 0.15 | Roulette | {arithmetic, √, log10} | 4.402 |
Crossover prob. : 0.85 Mutation prob. : 0.15 | Tournament | {arithmetic, √, log10} | 4.491 |
Dynamic | Roulette | {arithmetic, √, log10} | 4.503 |
Dynamic | Tournament | {arithmetic, √, log10} | 4.174 |
Operators probability | Sampling method | Set of functions | Best fitness in last population |
Crossover prob. : 0.85 Mutation prob. : 0.15 | Roulette | {arithmetic, √, log10} | 2.137 |
Dynamic | Tournament | {arithmetic, √, log10} | 1.218 |
Operators probability | Best fitness in last population | RMSE | MBE |
Operators prob.: [0.85, 0.15] Sampling: roulette Model: reference model |
4.402 | 1.862 | 1.322 |
Operators prob. : dynamic Sampling: tournament Model: reference model |
4.174 | 1.026 | 0.68 |
Operators prob. : dynamic Sampling: tournament Model: Parallel |
1.218 | 0.210 | 0.052 |
Model | RMSE |
Quadratic Regression model | 0.214 |
Logarithmic Regression model | 0.259 |
ANN 3-10-4-1 | 0.374 |
ANN 3-20-8-1 | 0.391 |
GP model with parallel functions | 0.210 |