
I explore the effect that albedo has on the amount of solar energy collected as well as the effectiveness of various building materials used in Jordan's varied construction industry. Albedo, which is the reflection of incoming radiation by surfaces, is of crucial relevance in minimizing the damage that solar radiation may do to building components like windows. The construction sector in Jordan is a substantial contributor to the overall level of energy consumption in the country because of the wide variety of building types, purposes and technologies found there. The findings of this research show that substances with greater albedo values produce the best results in terms of energy gains. These findings are supported by in-depth albedo value analyses and provide useful insights that may be used to improve building design and construction methods in Jordan, which will eventually lead to increased energy efficiency and sustainability within the construction sector.
Citation: Saad S. Alrwashdeh. The Effect of Environmental Albedo on the Energy Use of a Selected House in Amman-Jordan[J]. AIMS Environmental Science, 2023, 10(5): 628-643. doi: 10.3934/environsci.2023035
[1] | Huiqing Wang, Jiale Han, Haolin Li, Liguo Duan, Zhihao Liu, Hao Cheng . CDA-SKAG: Predicting circRNA-disease associations using similarity kernel fusion and an attention-enhancing graph autoencoder. Mathematical Biosciences and Engineering, 2023, 20(5): 7957-7980. doi: 10.3934/mbe.2023345 |
[2] | Jie Qiu, Maolin Sun, Chuanshan Zang, Liwei Jiang, Zuorong Qin, Yan Sun, Mingbo Liu, Wenwei Zhang . Five genes involved in circular RNA-associated competitive endogenous RNA network correlates with metastasis in papillary thyroid carcinoma. Mathematical Biosciences and Engineering, 2021, 18(6): 9016-9032. doi: 10.3934/mbe.2021444 |
[3] | Pingping Song, Jing Chen, Xu Zhang, Xiaofeng Yin . Construction of competitive endogenous RNA network related to circular RNA and prognostic nomogram model in lung adenocarcinoma. Mathematical Biosciences and Engineering, 2021, 18(6): 9806-9821. doi: 10.3934/mbe.2021481 |
[4] | Saranya Muniyappan, Arockia Xavier Annie Rayan, Geetha Thekkumpurath Varrieth . DTiGNN: Learning drug-target embedding from a heterogeneous biological network based on a two-level attention-based graph neural network. Mathematical Biosciences and Engineering, 2023, 20(5): 9530-9571. doi: 10.3934/mbe.2023419 |
[5] | Zejun Li, Yuxiang Zhang, Yuting Bai, Xiaohui Xie, Lijun Zeng . IMC-MDA: Prediction of miRNA-disease association based on induction matrix completion. Mathematical Biosciences and Engineering, 2023, 20(6): 10659-10674. doi: 10.3934/mbe.2023471 |
[6] | Huiqing Wang, Sen Zhao, Jing Zhao, Zhipeng Feng . A model for predicting drug-disease associations based on dense convolutional attention network. Mathematical Biosciences and Engineering, 2021, 18(6): 7419-7439. doi: 10.3934/mbe.2021367 |
[7] | Lei Chen, Kaiyu Chen, Bo Zhou . Inferring drug-disease associations by a deep analysis on drug and disease networks. Mathematical Biosciences and Engineering, 2023, 20(8): 14136-14157. doi: 10.3934/mbe.2023632 |
[8] | Meng Jiang, Bo Zhou, Lei Chen . Identification of drug side effects with a path-based method. Mathematical Biosciences and Engineering, 2022, 19(6): 5754-5771. doi: 10.3934/mbe.2022269 |
[9] | Maoxing Liu, Jie Zhang, Zhengguang Li, Yongzheng Sun . Modeling epidemic in metapopulation networks with heterogeneous diffusion rates. Mathematical Biosciences and Engineering, 2019, 16(6): 7085-7097. doi: 10.3934/mbe.2019355 |
[10] | Sonza Singh, Anne Marie France, Yao-Hsuan Chen, Paul G. Farnham, Alexandra M. Oster, Chaitra Gopalappa . Progression and transmission of HIV (PATH 4.0)-A new agent-based evolving network simulation for modeling HIV transmission clusters. Mathematical Biosciences and Engineering, 2021, 18(3): 2150-2181. doi: 10.3934/mbe.2021109 |
I explore the effect that albedo has on the amount of solar energy collected as well as the effectiveness of various building materials used in Jordan's varied construction industry. Albedo, which is the reflection of incoming radiation by surfaces, is of crucial relevance in minimizing the damage that solar radiation may do to building components like windows. The construction sector in Jordan is a substantial contributor to the overall level of energy consumption in the country because of the wide variety of building types, purposes and technologies found there. The findings of this research show that substances with greater albedo values produce the best results in terms of energy gains. These findings are supported by in-depth albedo value analyses and provide useful insights that may be used to improve building design and construction methods in Jordan, which will eventually lead to increased energy efficiency and sustainability within the construction sector.
Circular RNA (circRNA) is a special type of endogenous non-coding RNA. Unlike linear RNAs, circRNAs are produced through the process of back splicing, and their 3' and 5' ends are connected by either exon or intron circularization to form a covalently closed continuous loop structure. In 1976, Sanger et al. first discovered circRNA in viroid and Sendai virus particles of infected plants by electron microscopy and other techniques [1]. In 1979, Hsu et al. observed the presence of circRNA in the cytoplasm of eukaryotic cells through electron microscopy [2]. After that, more and more circRNAs were discovered in eukaryotic cells and fungal cells [3,4,5,6]. Increasing evidence shows that circRNAs are widely distributed in animals and plants, and they play important roles in many biological processes. CircRNAs can function as miRNA sponges [7], regulators of RNA-binding proteins [8] and parental gene transcription [9]. In recent years, circRNA has also been used as a new clinical diagnostic marker and a potential target for human disease treatment [10,11,12,13]. For example, Xu et al. found that circRNA Cdrlas is a therapeutic target for diabetes, which affects insulin secretion in islet cells by tricking miR-7 [10]. Cui et al. reported that hsa_circRNA_103636 is a potential new biomarker for depression [12]. Lu et al. found that hsa_circ_0063425 and hsa_circ_0056891 are biomarkers for the early stages of type 2 diabetes [13]. The aforementioned facts indicate that some circRNAs have strong associations with certain diseases. On the one hand, investigations on such associations are helpful to elucidate the mechanisms of circRNA functional roles; on the other hand, such investigations explore the pathogenesis of complex diseases. Thus, these investigations improve disease diagnoses and promote disease-targeted therapies.
To date, only a few circRNA-disease associations (CDAs) have been determined. These limited associations have become a hinderance to further studying mechanisms of circRNA functional roles and uncover the pathogenesis of diseases. Evidently, the determination of numerous CDAs can improve our understanding on circRNAs and diseases. Traditional clinical trials can provide a solid determination of CDAs; however, they always need lots of time and are very expensive. It is urgent to design quick and cheap methods to detect CDAs. Computational methods are deemed as suitable alternative tools to identify latent CDAs. Generally, abundant information on circRNAs and diseases is needed for building the computational methods. Fortunately, in recent years, some public databases, such as Circ2Disease [14], CircR2Disease [15], etc., have been set up, which are facilitated to discover hidden CDAs. On the other hand, the recent development of computational methods [16,17,18,19], especially various machine learning methods, provides more powerful technical support. These methods can deeply analyze the validated data and infer reliable and novel knowledge. In the field of CDA prediction, some computational methods have been proposed. These methods can be roughly divided into three groups: network-based methods [20,21,22,23], machine learning-based methods [24,25,26,27,28,29] and recommendation systems [30,31]. These previous methods are introduced in Section 2. Although current methods provide a high performance for the prediction of CDAs, there still exists spaces for improvement. For example, the feature representation of CDAs is far from perfect, thus decreasing the prediction quality of machine learning-based methods.
In this study, we combined the network-based and machine learning-based methods to develop a powerful model, named PCDA-HNMP, to identify CDAs, which can be deemed as the continued work of [25]. The main idea of PCDA-HNMP was to extract informative features from constructed networks, which were learnt by the downstream classification algorithm to generate efficient classification patterns. The contributions of this study were as below:
1) We proposed a prediction model to identify novel CDAs; the model was named PCDA-HNMP.
2) A heterogeneous network containing circRNAs, diseases, miRNAs and mRNAs was constructed using the currently known associations between above objects. Furthermore, the meta-paths for connecting circRNAs (diseases) were extracted from this heterogeneous network to further mine hidden associations between various circRNAs (diseases), inducing the networks for circRNAs (diseases). The powerful network embedding algorithm, mashup [32], was applied to the above networks to generate informative circRNA (disease) features.
3) To improve the learning quality, miRNA-disease associations (mDAs) were employed in PCDA-HNMP to help predict CDAs. The informative miRNAs features were obtained from bipartite networks of miRNAs and circRNAs, which were constructed based on the meta-paths connecting circRNAs and miRNAs in the heterogeneous network.
4) The features of circRNAs and diseases were combined to represent CDAs; mDAs were represented in a similar manner. PCDA-HNMP adopted the powerful classification algorithm, XGBoost (XGB) [33], to understand classification patterns from the aforementioned representations of CDAs and mDAs.
5) The five-fold cross-validation of the high performance PCDA-HNMP had an area under the curve (AUC) of 0.9846, which was better than those yielded by some existing state-of-the-art methods. Further tests showed that the employment of mDAs enhanced the model's performance; networks yielded by meta-paths containing validated CDAs provided the most important contributions to the PCDA-HNMP.
In this study, we investigated the problem of CDA prediction. To our knowledge, several methods have been proposed in this field, which can be roughly divided into three groups. Here, the brief descriptions on these methods were provided.
The network-based methods always construct networks for circRNAs, diseases or both and employ powerful network algorithms to build the methods. Two methods constructed a heterogeneous network defining circRNAs and diseases as nodes [21,23]. The edges in this network were determined by circRNA associations, disease associations and validated CDAs. Based on this heterogeneous network, the methods assigned a score to each pair of circRNAs and diseases used either KATZ [21] method or path weighted algorithms [23]. A high score indicates that the corresponding pair can be a latent CDA. Deng et al. improved the aforementioned method by employing inferred CDAs, which were obtained by validated circRNA-protein and protein-disease associations, to the heterogeneous network [22]. The KATZ method was used to measure the strength of the associations between circRNAs and diseases. The last network-based method was proposed by Li et al. [20], which was designed in a different way. Based on the validated CDAs, a bipartite network regarding circRNAs and diseases was constructed, which was fed into the DeepWalk software to yield the circRNA and disease features. The disease and circRNA topological similarities were calculated based on their features. Network consistency projection was applied to the similarity networks on circRNAs and diseases and the aforementioned bipartite network to evaluate the associations of circRNAs and diseases.
Some methods in this group adopted traditional machine learning algorithms to build the methods. First, they extracted informative features for circRNAs and diseases, then fed them into classification algorithms to learn the classification patterns. Zhang et al. proposed the iCDA-CGR to identify CDAs [24]. First, it constructed multiple similarity matrices for either circRNAs or diseases based on circRNA sequences, circRNA-gene associations, disease semantic and validated CDA information and combined them into one similarity matrix for circRNAs (diseases). The similarity scores in the matrix were picked up as the features of circRNAs (diseases), which were fed into a support vector machine to build the iCDA-CGR. Kouhsar et al. proposed the CircWalk method, which adopted a different scheme to generate the features of circRNAs (diseases) [25]. It constructed a heterogeneous network containing circRNAs, diseases, miRNAs and mRNAs and employed DeepWalk to generate the features of circRNAs (diseases) from this network. The XGB was applied to these features to understand classification patterns.
With the successful applications of deep learning algorithms, some methods to identify CADs adopted deep learning algorithms. The functions of deep learning in these methods were either to extract high-level features or to learn deep patterns for classification. GCNCDA, which was designed by Wang et al., used similarity scores as features of circRNAs (diseases) [26]. These raw features were refined by the fast learning with graph convolutional networks (FastGCN) algorithm and the output features were fed into the forest by penalizing attributes (Forest PA) algorithm to make a prediction. CDASOR employed the k-mers embedding in circRNA sequences to represent circRNAs [27], and were refined by a 1-D convolutional neural network (CNN) and bi-directional long short-term memory (BiLSTM). As for disease representation, it adopted an ontology embedding. A fully connected layer was applied to the above representation for making a prediction. Deng et al. presented the MSPCD method to identify CDAs [28]. The association scores were collected as the raw features of circRNAs (diseases), which were used to extract high-order features by fully connected layers. Finally, the deep neural network (DNN) was adopted to make a prediction. DMFCDA used the association relationship between circRNAs and diseases as the raw features [29], which were refined by three fully connected layers. The refined circRNA and disease features were combined and fed into two fully connected layers to make a prediction.
Recommendation systems always construct one or more kernels for circRNAs and diseases, respectively. These kernels were combined with an adjacency matrix of circRNAs and diseases to yield the recommendation matrix. iCircDA-MF, which was proposed by Wei and Liu, first constructed the circRNA kernel by integrating the CircRNA gene-related similarity and the Gaussian interaction profile (GIP) kernel similarity [30]. For the disease kernel, it integrated the disease semantic similarity and GIP kernel similarity. The adjacency matrix was reformulated by the weighted K nearest known neighbors (WKNKN) algorithm. Finally, it adopted a non-negative matrix factorization and solved an optimization problem to discover the optimal factorization, thereby generating the recommendation matrix. Li et al. used a similar method to build the kernels of circRNAs and diseases, though it solved a different optimization problem to produce the recommendation matrix [31].
An effective dataset is important to build efficient prediction models. Here, the validated CDAs were retrieved from a previous study [25]. These CDAs were collected from four public datasets: Circ2Disease [14], CircR2Disease [15], CTD [34], and CircAtlas [35]. The original dataset in [25] contained 575 validated CDAs, involving 474 circRNAs and 64 diseases. After combining different names of the same disease (e.g., hirschsprung disease and hirschsprung's disease), a total of 61 diseases was investigated in this study. Approximately 571 CDAs related to these diseases were maintained, which still covered 474 circRNAs. We termed 571 CADs as positive samples in this study. From these CADs, a matrix, denoted by Acd, was constructed. Acd(j,k)=1 if and only if the j-th circRNA and the k-th disease comprised an association.
Generally, the negative samples are necessary to construct binary classification models. Additionally, 575 unlabeled pairs of circRNA and diseases used in [25] were employed. After removing four pairs, we obtained unlabeled pairs equivalent to the number of positive samples. In fact, these pairs were obtained by randomly pairing circRNAs and diseases. Although, some pairs may be latent CADs, their probabilities are very low, and the proportion of latent CADs is very small. This random selection of unlabeled pairs is widely used in an association prediction [26,36,37,38]. Accordingly, 571 unlabeled pairs were regarded as negative samples, which were combined with positive samples to constitute the dataset, denoted as DS.
In addition, in recent years, mDA predictions have become a hot topic [39,40,41,42]. Since circRNA and miRNA are two special types of RNAs and several studies have reported their special associations [43,44,45], mDAs may be helpful in predicting CDAs. Thus, for the aforementioned 61 diseases, we extracted their related miRNAs from Circ2Disease [14], HMDD [46], and Mir2Disease [47]. Approximately 912 mDAs were obtained. Additionally, these associations were termed as positive samples. From these mDAs, we also constructed a matrix, denoted by Adi, where Adi(j,k)=1 if and only if the j-th disease and the k-th miRNA constituted an association. Likewise, 912 negative samples for mDAs were also generated by randomly selecting miRNAs and diseases. These positive and negative samples constituted the dataset DSm.
In this study, a new prediction model for CDAs was designed; the model was called PCDA-HNMP. Its construction procedures are illustrated in Figure 1. First, several types of associations between circRNAs, diseases, miRNAs and mRNAs were obtained from some public datasets, and a heterogeneous network containing these associations was constructed. Second, several types of meta-paths were extracted from the above-constructed heterogeneous network, which were further adopted to set up circRNA, disease and circRNA-miRNA networks. Then, the powerful network embedding algorithm, mashup, was employed to extract circRNA, disease and miRNA features. Finally, the feature representations of CDAs in DS and mDAs in DSm were obtained based on above features, which were fed into a binary classification algorithm to train the classification model. In the following sections, the detailed descriptions of each stage are given.
In recent years, the use of multi-sources to design classification models is quite popular. The abundant information behind the multi-sources is helpful to improve the performance of the models. This study attempted to design a model for predicting CDAs. It is known that miRNAs and mRNAs all have special associations with various diseases; the employment of them can help to discover latent CADs. On the other hand, a network is a powerful form, which can organize all objects at a system level. Thus, a heterogeneous network was constructed, which defined circRNAs, diseases, mRNAs and miRNAs as nodes. Its construction was similar to that in [25]. Besides nodes, edges are another component of the network. As there were four types of nodes in the network, we defined seven types of edges, each of which corresponded to one type of association between circRNAs, diseases, mRNAs and miRNAs. The CDAs and mDAs are mentioned in Section 3.1. The remaining five types of associations are described below.
circRNA-circRNA association. The sequence similarity of two circRNAs was adopted to measure their association. The sequences of 474 circRNAs were obtained from CircBase [48]. These sequences were fed into the BioPython package [49] to yield the similarity of any two circRNAs. The average similarity score was set as the threshold to determine the association of two circRNAs. As a result, 46 circRNA-circRNA associations were obtained.
circRNA-miRNA association. This type of association was obtained from RAID [50] and StarBase [51]. Approximately, 1313 associations were contained in the network.
miRNA-mRNA association. This type of association was sourced from miRTarbase [52], Circ2Disease [14], and StarBase [51]. Approximately, 1194 associations were obtained.
mRNA-disease association. DisGeNet is a public database that collects the related genes of various diseases. From this database, the mRNA-disease associations were downloaded. Approximately, 1931 associations were finally used and included in the network.
Disease-disease association. The semantic similarity of any two diseases was calculated based on the tree structure of diseases in the medical subject heading (MeSH) [53] and Wang et al.'s method [54]. After setting the threshold to 0.8, 162 disease-disease associations were obtained.
From the aforementioned five types of associations, we constructed five matrices, denoted by Acc, Aci, Aim, Amd and Add, respectively, using the same method for constructing Acd and Adi. The descriptions of these symbols are listed in Table 1. Furthermore, the aforementioned five types of associations and those mentioned in Section 3.1 were all integrated in the constructed heterogeneous network. This network contained 2532 nodes and 6129 edges. The detailed information of this network is provided in Table 2. For convenience, this network is denoted by HN.
Symbol | Descriptions |
Acd | Association matrix between circRNAs and diseases |
Adi | Association matrix between diseases and miRNAs |
Acc | Association matrix between circRNAs |
Aci | Association matrix between circRNAs and miRNAs |
Aim | Association matrix between miRNAs and mRNAs |
Amd | Association matrix between mRNAs and diseases |
Add | Association matrix between diseases |
Vij | Raw feature vector of the i-th node in the j-th network yielded by RWR algorithm |
Xi | Final feature vector of the i-th node |
Wij | Context feature vector of the i-th node in the j-th network |
Aij | Reconstructed feature vector of the i-th node in the j-th network |
Aijk | The k-th component of the reconstructed feature vector of the i-th node in the j-th network |
Type | Entry | Number |
Nodes | circRNA | 474 |
Disease | 61 | |
miRNA | 632 | |
mRNA | 1365 | |
Association | circRNA←→ disease | 571 |
circRNA←→circRNA | 46 | |
circRNA←→miRNA | 1313 | |
miRNA←→disease | 912 | |
miRNA←→mRNA | 1194 | |
mRNA←→disease | 1931 | |
disease←→disease | 162 |
A meta-path is a widely used concept when constructing models based on networks [55,56,57]. The hidden association information can be extracted from a given network using a meta-path. From the heterogeneous network HN constructed above, we can further infer the circRNA-circRNA associations based on the paths connecting two circRNAs. For example, two circRNAs, c1 and c2, were not directly connected in HN but have a common neighbor, say miRNA mi1. The meta-path c1-mi1-c2 indicated the special association between them. In a similar manner, the inferred associations for other objects can be extracted. By employing these inferred associations, more abundant information for circRNAs, diseases and miRNAs can be obtained, thereby helping construct more efficient classification models.
Based on the length and inner nodes of meta-paths, we extracted several types of meta-paths for circRNAs, diseases and miRNAs. Since the number of meta-paths sharply increases with an increase in path length, we only considered the meta-paths with a length less than three. With these meta-paths, the induced networks for circRNAs, diseases and miRNAs were constructed.
For circRNA, four types of meta-paths were extracted from HN: circRNA-circRNA (meta-path-1); circRNA-circRNA-circRNA (meta-path-2); circRNA-disease-circRNA (meta-path-3); and circRNA-miRNA-circRNA (meta-path-4). The adjacency matrices for the induced networks of the aforementioned meta-paths can be obtained as follows:
{X1=Accfor meta−path−1X2=χZ+(Acc×Acc)for meta−path−2X3=χZ+(Acd×ATcd)X4=χZ+(Aci×ATci)for meta−path−3for meta−path−4 | (1) |
where χZ+(·) is a characteristic function of the positive integer set Z+ on each member in the matrix, and Acc, Acd, and Aci are association matrices between circRNAs, circRNAs and diseases, and circRNAs and miRNAs, respectively (see Table 1 for detailed descriptions).
Likewise, five types of meta-paths were extracted from HN for diseases: disease-disease (meta-path-5); disease-disease-disease (meta-path-6); disease-miRNA-disease (meta-path-7); disease-circRNA-disease (meta-path-8); and disease-mRNA-disease (meta-path-9). The adjacency matrices for the induced networks of above meta-paths can be accessed by the following:
{X5=Addformeta−path−5X6=χZ+(Add×Add)for meta−path−6X7=χZ+(Adi×ATdi)X8=χZ+(ATcd×Acd)X9=χZ+(ATmd×Amd)for meta−path−7for meta−path−8for meta−path−9 | (2) |
where χZ+(·) is a characteristic function of the positive integer set Z+ on each member in the matrix, and Add, Adi, Acd, and Amd are association matrices between diseases, diseases and miRNAs, circRNAs and diseases, and mRNAs and diseases, respectively (see Table 1 for detailed descriptions).
The meta-paths for miRNAs were different from those for circRNA and diseases because miRNA was an aid for the prediction of CDAs. In the training procedure, the role of miRNA was the same as circRNA. The meta-paths for miRNAs should be related to circRNAs. Thus, we adopted the following three types of meta-paths: circRNA-miRNA (meta-path-10); circRNA-circRNA-miRNA (meta-path-11); and circRNA-disease-miRNA (meta-path-12). Similarly, the adjacency matrices of their corresponding induced networks were constructed by the following:
{X10=Acifor meta−path−10X11=χZ+(Acc×Aci)for meta−path−11X12=χZ+(Acd×Adi)for meta−path−12 | (3) |
where χZ+(·) is a characteristic function of the positive integer set Z+ on each member in the matrix, and Aci, Acc, Acd, and Adi are association matrices between circRNAs and miRNAs, circRNAs, circRNAs and diseases, and diseases and miRNAs, respectively (see Table 1 for detailed descriptions).
The meta-path-induced networks included abundant information of circRNAs, diseases and miRNAs. It was challenging to quantify this information and extract the informative features for them. Some network embedding algorithms have been proposed to extract the essential features for nodes from one or more networks, such as DeepWalk [58], node2vec [59], etc. These algorithms can abstract linkages in the network and assign features to each node. As multiple networks were built for each of the circRNAs, diseases and miRNAs, we selected the powerful network embedding algorithm, mashup [32], which is the only one network embedding algorithm that can tackle more than one networks at one time. Its brief description is described below.
Given m networks, denoted as N1,N2,⋯,Nm, the random walk with restart (RWR) algorithm [60,61] is applied on each network, where each node is selected one by one as a seed node. When the i-th node in Nj is set as the seed node, the RWR algorithm assigns a probability to each node in Nj. The vector collecting these probabilities is picked up as the raw feature vector of such node, denoted by Vij. Such raw feature vectors always have large dimensions and multiple vectors can be obtained for the same node, which are derived from different networks. Thus, the following procedures from mashup fuse different feature vectors for the same node into a unified vector and reduce the dimension. Let Xi be the unified feature vector of the i-th node in all networks and Wij be the context feature vector of the i-th node in the j-th network. Based on Xi and Wij, the vector for the i-th node in the j-th network can be constructed and formulated by the following:
Aij=(Aij1,Aij2,⋯,Aijn)T | (4) |
where n is the total number of nodes and its k-th component Aijk(1≤k≤n) is defined by the following:
Aijk=exp((Xi)TWkj)∑k'exp((Xi)TWk'j) | (5) |
where Xi is the unified feature vector of the i-th node and Wkj is the context feature vector of the k-th node in the j-th network. Generally, Aij should be similar to Vij as much as possible so that the best values in Xi and Wij can be obtained. Thus, mashup solves the following optimization problem:
MinimizeXi,Wij1n∑mj=1∑ni=1DKL(Vij||Aij) | (6) |
where DKL represents the function of KL-divergence (relative entropy), and n and m are the number of nodes and networks, respectively.
The current study adopted the mashup program obtained from http://cb.csail.mit.edu/cb/mashup/, which was performed with its default parameters. This program was applied to the meta-path-induced networks of circRNAs, diseases, and miRNAs to generate features of circRNAs, diseases, miRNAs, respectively. Particularly, for the feature vectors derived from meta-path-induced networks for miRNAs, we only picked up the feature vectors of miRNAs and discarded the features for circRNAs.
The features of circRNAs, diseases, miRNAs were yielded by mashup. For CDAs in DS, the features of circRNAs and diseases were combined, whereas the features of miRNAs and diseases were aggregated for mDAs in DSm. The feature vectors of CDAs and mDAs, along with their labels (1 for positive and 0 for negative), were collected as the underlying dataset. A classification algorithm was necessary to build the binary classification model on the above dataset. In this study, six classification algorithms were attempted, including support vector machine (SVM) [62], logistic regression (LR) [63], random forest (RF) [64], AdaBoost and random forest as base classifier (ABRF) [65], XGB [33] and multilayer perceptron (MP) [66]. Based on different classification algorithms, the models were tested by a cross-validation method [67], thereby building the optimal classification model. The corresponding Python packages in scikit-learn [68] were adopted to implement the aforementioned six algorithms.
In this study, we adopted a five-fold cross-validation [67] to evaluate the performance of all models. In the original five-fold cross-validation, samples are divided into five parts. Each part was selected as a test dataset one by one, whereas the remaining parts constituted the training dataset. The model built on the training dataset is applied to the test dataset. Finally, each sample is tested exact once. This study slightly changed this procedure as mDAs in DSm were used to improve the prediction quality and were always poured into the training dataset. In detail, CDAs in DS were randomly and equally divided into five parts. Each part was singled out as test dataset one by one, whereas the rest four parts and DSm comprised the training dataset. In this case, only the CDAs in DS were tested.
The predicted results of a binary classification model can be counted as four entries: true positive (TP), false positive (FP), true negative (TN) and false negative (FN). Based on these entries, some measurements can be computed. This study adopted sensitivity (SN) (same as recall), specificity (SP), precision, accuracy (ACC) and F1-measure [69,70,71,72,73,74], which can be calculated by the following:
{SN=TPTP+FNSP=TNTN+FPPrecision=TPTP+FPACC=TP+TNTP+TN+FP+FNF1−measure=2×Precision×RecallPrecision+Recall | (7) |
The above measurements only evaluate the performance under a fixed threshold for determining the positive sample, which is generally set to 0.5. A receiver operating characteristic (ROC) curve can reflect the performance of models under various thresholds. By setting different values of the threshold, a group of SN and SP can be obtained. The ROC curve sets SN as the Y-axis and 1-SP as the X-axis in a coordinate system. The area under the curve (AUC) is a key measurement to assess the performance of the models. In general, the higher the AUC, the higher the performance. This study adopted AUC as the major measurement.
This study designed a new computational model for the identification of CDAs, named PCDA-HNMP. The entire construction procedures are illustrated in Figure 1. This section performed tests on this model and proved its superiority. To execute the tests, the mashup [32] program retrieved from http://cb.csail.mit.edu/cb/mashup/ and Python packages of six classification algorithms downloaded from scikit-learn [68] were adopted. All codes used in this study are available at https://github.com/Zxy-zxy0/PCDA-HNMP.git.
The proposed model, PCDA-HNMP, extracted features from meta-path-induced networks. In this study, six classification algorithms were attempted to discover a proper classification algorithm for tackling these features, as mentioned in Section 3.6. The main parameters of these algorithms were tuned, and the final optimal parameters were obtained and provided in https://github.com/Zxy-zxy0/PCDA-HNMP/tree/master/Code/classifier. As for the feature dimension yielded by mashup, we also tried various values, including 10, 20, 30, 40 and 50. All models with different parameters and feature dimensions were evaluated by a five-fold cross-validation. Generally, the feature dimension may influence the performance of the model. Figure 2 shows the performance of the models with six different classification algorithms under various dimensions, which was measured by AUC. It can be observed that the influence of the feature dimension for different classification algorithms was different. SVM, LR and MP were influenced by a feature dimension more than the other three algorithms and their performance was evidently lower than ABRF, XGB and RF. The performance of models with ABRF, XGB and RF under different dimensions was almost at the same level. After simple comparisons, we can obtain the optimal feature dimension for each classification algorithm. The optimal feature dimensions for MP, SVM and LR were all 50, whereas such dimensions for XGB, ABRF and RF were all 20. The detailed performance for different classification algorithms (under their optimal feature dimensions) is listed in Table 3. Evidently, the model with XGB provided the best ACC, F1-measure, and SN, and the highest precision and SP was accessed by the models with either ABRF or RF, respectively. The ROC curves of these six models are illustrated in Figure 3. It can be observed that the model with XGB yielded the highest AUC of 98.46%, followed by the models with ABRF, RF, MP, SVM and LR. From these results, we can conclude that the model with XGB yielded the highest performance. Accordingly, the PCDA-HNMP adopted XGB as the classification algorithm and the feature dimension was set to 20.
Classification algorithm (feature dimension) | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) |
XGB (20) | 94.40 | 94.39 | 94.42 | 94.40 | 94.40 |
ABRF (20) | 93.79 | 93.66 | 95.51 | 91.95 | 95.62 |
RF (20) | 93.00 | 92.80 | 95.45 | 90.38 | 95.62 |
MP (50) | 89.40 | 89.65 | 87.75 | 91.78 | 87.03 |
SVM (50) | 67.86 | 68.43 | 67.22 | 69.71 | 66.02 |
LR (50) | 64.01 | 63.91 | 64.14 | 63.75 | 64.28 |
PCDA-HNMP was designed for the identification of CDAs. From its construction procedures, the information of mDAs was employed when training the model. The purpose was to improve the training quality, thereby enhancing the performance of PCDA-HNMP. This section provides the evidence to prove that the employment of mDAs was helpful.
By removing the mDAs, we built another model. Its parameter was similar to those in PCDA-HNMP. Additionally, this model was evaluated by a five-fold cross-validation. The evaluation results are listed in Table 4. For easy comparisons, the results for PCDA-HNMP are also provided in this table (last row of Table 4). It can be observed that all measurements of PCDA-HNMP were higher than those of the model removing mDAs. For example, PCDA-HNMP improved the AUC by 0.15%, and the ACC and F1-measure were 1.32% and 1.26% higher than those of the model removing mDAs, respectively. Based on these results, the employment of mDAs can really improve the performance of PCDA-HNMP. Since there are close relationship between circRNAs and miRNAs, the existing mDAs can help predict CDAs. For example, if one miRNA is associated with one disease, the circRNA that has close relationship with this miRNA may also be associated with this disease. This is an important reason as to why the employment of mDAs can enhance the performance of PCDA-HNMP.
Addition of mDAs | Addition of meta-path-induced networks | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) | AUC (%) |
⊆ | √ | 93.08 | 93.13 | 92.48 | 93.88 | 92.30 | 98.31 |
√ | ⊆ | 91.24 | 91.16 | 92.17 | 90.20 | 92.29 | 97.52 |
√ | √ | 94.40 | 94.39 | 94.42 | 94.40 | 94.40 | 98.46 |
In PCDA-HNMP, a number of meta-path-induced networks were constructed to generate the features of circRNAs, diseases and miRNAs. To indicate the importance of these networks, some tests were conducted.
The first test was to prove the necessity of meta-path-induced networks. In fact, from the heterogeneous network HN, the features of circRNAs, diseases and miRNAs can be directly obtained via mashup. Based on these features, the model for the identification of CDAs can be built. Additionally, this model was evaluated by a five-fold cross-validation. The results are listed in Table 4. The ACC, F1-measure, precision, SN, SP and AUC were 91.24%, 91.16%, 92.17%, 90.20%, 92.29% and 97.52%, respectively. Compared with the corresponding measurements of PCDA-HNMP (last row of Table 4), each measurement was reduced. In detail, the AUC decreased about 1%, whereas others declined greater than 2%. These results proved that meta-path-induced networks can further mine the relationships between circRNAs, diseases and miRNAs, thereby generating more informative features to enhance the performance of PCDA-HNMP. The employment of meta-path-induced networks was a good and effective choice.
The second test further examined which meta-path-induced networks were more important (i.e., which networks provided more contributions for building PCDA-HNMP). To this end, each meta-path-induced network was removed one by one. We used rest networks to produce features and then built the model. The performance of each model is listed in Table 5 and its ROC curve, along with AUC, is provided in Figure 4. It can be found that the performance of these models were all lower than PCDA-HNMP. The removal of each meta-path-induced network reduced each measurement listed in Table 5. As for the AUC values, they also decreased compared with that of PCDA-HNMP. Each network gave positive contributions to PCDA-HNMP. Furthermore, the decline degree was not same when different networks were removed. Evidently, when the network X3 was removed, the decline degree reached a maximum. The ACC, F1-measure and AUC dropped to 66.11%, 65.41% and 72.44%, respectively. The decline degree exceeded 25%. This result indicated that X3 was most important for constructing PCDA-HNMP. The network X3 was derived from meta-path-3 (circRNA-disease-circRNA). It was reasonable that the removal of currently known CDAs can provide the greatest influence to the model. As for other networks, they almost provide similar contributions. Relative speaking, networks X8 and X12, which were induced by meta-path-8 (disease-circRNA-disease) and meta-path-12 (circRNA-disease-miRNA), were more important than others. These networks were all related to associations between circRNAs and diseases. Therefore, this result was also reasonable.
Meta-path-induced network | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) |
X1 | 92.21 | 92.25 | 91.88 | 92.65 | 91.77 |
X2 | 92.91 | 92.94 | 92.41 | 93.52 | 92.30 |
X3 | 66.11 | 65.41 | 66.84 | 64.10 | 68.13 |
X4 | 94.05 | 94.06 | 93.75 | 94.40 | 93.70 |
X5 | 92.30 | 92.31 | 92.16 | 92.47 | 92.12 |
X6 | 91.59 | 91.63 | 91.34 | 91.94 | 91.24 |
X7 | 92.21 | 92.23 | 92.00 | 92.47 | 91.95 |
X8 | 91.33 | 91.42 | 90.58 | 92.30 | 90.37 |
X9 | 91.42 | 91.48 | 90.87 | 92.13 | 90.72 |
X10 | 92.56 | 92.58 | 92.34 | 92.82 | 92.29 |
X11 | 91.42 | 91.53 | 90.44 | 92.65 | 90.20 |
X12 | 91.07 | 91.12 | 90.67 | 91.59 | 90.55 |
To date, several models have been designed for the identification of CDAs. To prove the superiority of PCDA-HNMP, we compared it with some existing state-of-the-art models, including DMFCDA [29], GCNCDA [26], CircWalk [25] and SIMCCDA [31]. Moreover, their performance was evaluated by a five-fold cross-validation. The obtained measurements, including the ACC, F1-measure, precision, SN and SP, are listed in Table 6. For easy comparisons, these measurements yielded by PCDA-HNMP are also provided in this table. Clearly, PCDA-HNMP provided the highest performance amongst all measurements. CircWalk gave the second highest performance. Each measurement of CircWalk was about 2.5% lower than that of PCDA-HNMP. As for the other three models (DMFCDA, GCNCDA and SIMCCDA), their performance was much lower than PCDA-HNMP. In detail, PCDA-HNMP yielded close to or more than 10% on each measurement. These results indicate that PCDA-HNMP evidently outperforms the above state-of-the-art models.
Model | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) |
PCDA-HNMP | 94.40 | 94.39 | 94.42 | 94.40 | 94.40 |
CircWalk [25]$ | 92.09 | 92.08 | 92.36 | 91.83 | 92.35 |
DMFCDA [29]$ | 83.69 | 83.69 | 81.55 | 87.79 | 79.60 |
GCNCDA [26]$ | 74.52 | 74.90 | 73.79 | 76.17 | 72.87 |
SIMCCDA [31]$ | 83.36 | 16.40 | 9.10 | 84.54 | 83.34 |
$: The performance of these models was directly obtained from [25]. |
Additionally, we compared the ROC curves of PCDA-HNMP and the above four models, as illustrated in Figure 5. Similar to the results listed in Table 6, PCDA-HNMP yielded the highest AUC, which was higher than 98%. Circwalk still provided the second highest AUC, which was 97.77%. The AUC values of other three models were quite low, lower than 90%. Based on these results, we can further conclude that PCDA-HNMP was better than these existing models for the identification of CDAs.
According to the construction procedures of DMFCDA, GCNCDA and SIMCCDA, they employed limited sources. The objects of these models were only circRNAs and diseases. Few sources induced that they cannot completely describe circRNAs and diseases. This was the main reason why they were much more inferior to our model (PCDA-HNMP) and CircWalk. As for CircWalk, it employs miRNAs and mRNAs alongside circRNAs and diseases. However, it directly extracted circRNA and disease features from the heterogeneous network using DeepWalk [58]. Our model (PCDA-HNMP) gave a deep overview on the heterogeneous network and deeply mined the relationships between the circRNAs and diseases via meta-paths. Then, a more powerful network embedding algorithm, mashup, was adopted to extract features of circRNAs and diseases from the above-obtained relationships. Thus, the features used in PCDA-HNMP were more informative than those used in CircWalk. In addition, when training PCDA-HNMP, the known mDAs were employed. Since a close relationship exists between circRNAs and miRNAs, mDAs can help predict CDAs, thereby further improving the performance of PCDA-HNMP. Thus, it was logical that PCDA-HNMP outperformed CircWalk.
In this study, we proposed a new model, PCDA-HNMP, for the identification of CADs. From the results listed in Section 4, it was found that the PCDA-HNMP had a high performance, with an AUC of 98.46%. The comparison results in Section 4.4 indicated that the model was better than four alternative methods. Compared with these methods, our model has three major advantages. The first advantage was the use of more related information of circRNAs and diseases. The heterogeneous network in PCDA-HNMP contained miRNAs and mRNAs alongside circRNAs and diseases. Several studies have been reported that miRNAs and mRNAs were highly related to circRNAs or diseases [40,41,43,45,75]. Their additions can enhance the changes to discover more hidden associations between circRNAs and diseases. Many alternative methods employed limited objects and were only used to measure the similarity between circRNAs and diseases (i.e., they were indirectly used in the methods). In our model, miRNAs and mRNAs were directly listed in the heterogeneous network, suggesting they can play more important roles in constructing the model. As for the second advantage, we employed the meta-paths to mine the hidden associations between circRNAs and diseases or circRNAs and miRNAs. The heterogeneous network was built based on the current knowledge on circRNAs, diseases, miRNAs and mRNAs. Evidently, such knowledge is far from complete. The employment of a meta-path can infer such unknown knowledge, thereby improving the model's performance, which has been proven in Section 4.3. One alternative method, CircWalk, adopted the same heterogeneous network but not apply a meta-path to mine hidden associations between circRNAs and diseases or circRNAs and miRNAs, thereby inducing a decreased performance compared to PCDA-HNMP. The last advantage is the employment of mDAs when training the model. Fusing the information of related objects was a common way to construct more efficient prediction models. However, most previous methods adopted these objects to obtain additional information of the main objects. For example, the miRNAs can be used to measure the associations between circRNAs or diseases. In general, they did not participate in the model training procedures. In our opinion, the validated mDAs were helpful to find out novel CDAs since miRNAs and circRNAs had close relationships, and the similar miRNAs and circRNAs may be related to the same disease, which can enhance the model's performance. The test results in Section 4.2 confirmed the above facts.
Besides the advantages, our model also had some disadvantages or limitations. The first disadvantage was the use of mDAs. Although the employment of mDAs can improve the model's performance, the improvement degree was limited (less than 0.2% on AUC). As miRNAs had the same roles to circRNAs when training the model, developing a method to extract miRNA features that were in the same space of the circRNA features was a challenging problem. In our model, we generated miRNA features from the bipartite networks of miRNAs and circRNAs. It was not clear whether this method was optimal. In our future work, we will design a more reasonable way to obtain miRNA features. We can potentially use the idea of graph convolution network to transfer the circRNA features to miRNAs so that circRNA and miRNA features were in the same space. PCDA-HNMP was a traditional machine learning model, which induced the second limitation. The feature extraction and classification procedures were completely separated. This meant that the features of CDAs and mDAs were not very special for the identification of CDAs. In deep learning, the end-to-end scheme gave us a new direction to build more powerful prediction models. In the future, we will fuse deep learning algorithms into our model to further enhance model's performance.
This study proposed a new model, PCDA-HNMP, for predicting circRNA-disease associations. To access informative features of circRNAs and diseases, a heterogeneous network was constructed and networks for circRNAs and diseases were built in terms of meta-paths extracted from the heterogeneous network. The test results indicated that the addition of meta-path-induced networks can really improve model's performance. Furthermore, we employed miRNA-disease associations when training the model, which can also improve the prediction quality. The superiority of PCDA-HNMP compared with some previous models indicated that it can be a useful tool to identify circRNA-disease associations. The codes and related data are available at https://github.com/Zxy-zxy0/PCDA-HNMP.git.
The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.
The authors declare there is no conflict of interest.
[1] |
Ren X, Li J, He F, et al. (2023) Impact of climate policy uncertainty on traditional energy and green markets: Evidence from time-varying granger tests. Renew Sust Energ Rev 173: 113058. https://doi.org/10.1016/j.rser.2022.113058 doi: 10.1016/j.rser.2022.113058
![]() |
[2] |
Abdel-Fattah A, Hamdan S, Ayadi O, et al. (2022) Energy efficiency and thermal insulation code violations for residential buildings in Jordan and the role of social advocacy campaigns. Energ Sustain Dev 71: 419–432. https://doi.org/10.1016/j.esd.2022.10.013 doi: 10.1016/j.esd.2022.10.013
![]() |
[3] |
Qadourah J A (2022) Energy and economic potential for photovoltaic systems installed on the rooftop of apartment buildings in Jordan. Results Eng 16: 100642. https://doi.org/10.1016/j.rineng.2022.100642 doi: 10.1016/j.rineng.2022.100642
![]() |
[4] |
Abu Qadourah J, AL-Falahat A M, Alrwashdeh S S (2022) Alrwashdeh, Assessment of solar photovoltaics potential installation into multi-family building's envelope in Amman, Jordan. Cogent Eng 9 https://doi.org/10.1080/23311916.2022.2082059 doi: 10.1080/23311916.2022.2082059
![]() |
[5] |
Alrwashdeh S S (2019) Investigation of Wind Energy Production at Different Sites in Jordan Using the Site Effectiveness Method. Energ Eng J Assoc Energ Eng 116: 47–59. https://doi.org/10.1080/01998595.2019.12043338 doi: 10.1080/01998595.2019.12043338
![]() |
[6] | Alrwashdeh S S, Alsaraireh F M (2018) Wind energy production assessment at different sites in Jordan using probability distribution functions. ARPN J Eng Appl Sci 13: 8163–8172. |
[7] | Juaidi A, Abdallah R, Ayadi O, et al. (2022) Chapter 7 - Wind energy in Jordan and Palestine: current status and future perspectives, in Renewable Energy Production and Distribution, M. Jeguirim, Editor. Academic Press. 229–269. https://doi.org/10.1016/B978-0-323-91892-3.00006-6 |
[8] |
Abu-Rumman G, Khdair A I, Khdair S I (2020) Current status and future investment potential in renewable energy in Jordan: An overview. Heliyon 6: e03346. https://doi.org/10.1016/j.heliyon.2020.e03346 doi: 10.1016/j.heliyon.2020.e03346
![]() |
[9] |
Al Afif R, Ayed Y, Maaitah O N (2023) Feasibility and optimal sizing analysis of hybrid renewable energy systems: A case study of Al-Karak, Jordan. Renew Energy https://doi.org/10.1016/j.renene.2022.12.109 doi: 10.1016/j.renene.2022.12.109
![]() |
[10] |
Alawin A A, Rahmeh T A, Jaber J O, et al. (2016) Renewable energy education in engineering schools in Jordan: Existing courses and level of awareness of senior students. Renew Sust Energ Rev 65: 308–318. https://doi.org/10.1016/j.rser.2016.07.003 doi: 10.1016/j.rser.2016.07.003
![]() |
[11] |
Alawneh R, Ghazali F, Ali H, et al. (2019) A Novel framework for integrating United Nations Sustainable Development Goals into sustainable non-residential building assessment and management in Jordan. Sustain Cities Soc 49: 101612. https://doi.org/10.1016/j.scs.2019.101612 doi: 10.1016/j.scs.2019.101612
![]() |
[12] |
Albatayneh A, Hindiyeh M, AlAmawi R (2022) Potential of renewable energy in water-energy-food nexus in Jordan. Energ Nexus 7: 100140. https://doi.org/10.1016/j.nexus.2022.100140 doi: 10.1016/j.nexus.2022.100140
![]() |
[13] |
Qadourah J A, Alrwashdeh S (2022) ECONOMIC FEASIBILITY OF HEATING SOURCE CONVERSION OF THE SWIMMING POOLS. J Appl Eng Sci 20: 230–238. https://doi.org/10.5937/jaes0-34474 doi: 10.5937/jaes0-34474
![]() |
[14] | Al-Falahat A M, Kardjilov N, Khanh T V, et al. (2019) Energy-selective neutron imaging by exploiting wavelength gradients of double crystal monochromators—Simulations and experiments. Nuclear Instruments and Methods in Physics Research, Section A: Accelerators, Spectrometers, Detectors and Associated Equipment 943. https://doi.org/10.1016/j.nima.2019.162477 |
[15] |
Ala'a M, Qadourah J A, Alrwashdeh S S, et al. (2022) Energy performance and economics assessments of a photovoltaic-heat pump system. Results Eng 13. https://doi.org/10.1016/j.rineng.2021.100324 doi: 10.1016/j.rineng.2021.100324
![]() |
[16] |
Ali H, Hashlamun R (2019) Envelope retrofitting strategies for public school buildings in Jordan. J Buiild Eng 25: 100819. https://doi.org/10.1016/j.jobe.2019.100819 doi: 10.1016/j.jobe.2019.100819
![]() |
[17] |
Alkhalidi A, Alqarra K, Abdelkareem M A, et al. (2022) Renewable energy curtailment practices in Jordan and proposed solutions. Int J Thermofluids 16: 100196. https://doi.org/10.1016/j.ijft.2022.100196 doi: 10.1016/j.ijft.2022.100196
![]() |
[18] |
Almomani E, Hammad E A, AlQutob R, et al. (2022) Capacity Building for Health Technology Assessment in Jordan: Institutionalization and Its Use in Pricing and Reimbursement Decisions. Value Health Reg Iss 32: 47–53. https://doi.org/10.1016/j.vhri.2022.07.006 doi: 10.1016/j.vhri.2022.07.006
![]() |
[19] | Alrwashdeh S S (2017) Determining the optimum tilt solar angle of a PV applications at different sites in Jordan. J Eng Appl Sci 12: 9295–9303. |
[20] | Alrwashdeh S S (2018)Predicting of energy production of solar tower based on the study of the cosine efficiency and the field layout of heliostats. Int J Mech Eng Technol 9: 250–257. |
[21] | Alrwashdeh S S (2018) Investigation of the energy output from PV racks based on using different tracking systems in Amman-Jordan. Int J Mech Eng Technol 9: 687–694. |
[22] | Alrwashdeh S S (2018) Comparison among solar panel arrays production with a different operating temperatures in Amman-Jordan. Int J Mech Eng Technol 9: 420–429. |
[23] |
Bartmiński P, Siłuch M (2022) Mapping the albedo of the active surface at different stages of the growing season using data from various sources. Remote Sens Appl Soc Environ 28: 100818. https://doi.org/10.1016/j.rsase.2022.100818 doi: 10.1016/j.rsase.2022.100818
![]() |
[24] |
Cierniewski J, Ceglarek J (2023) Influence of arable land location incorporating its roughness on blue-sky albedo variation. J Quantit Spectrosc Radiat Transf 296: 108440. https://doi.org/10.1016/j.jqsrt.2022.108440 doi: 10.1016/j.jqsrt.2022.108440
![]() |
[25] |
Du J, Zhou H, Jacinthe P A, et al. (2023) Retrieval of lake water surface albedo from Sentinel-2 remote sensing imagery. J Hydrol 617: 128904. https://doi.org/10.1016/j.jhydrol.2022.128904 doi: 10.1016/j.jhydrol.2022.128904
![]() |
[26] |
Lin X, Wu S, Chen B, et al. (2022) Estimating 10-m land surface albedo from Sentinel-2 satellite observations using a direct estimation approach with Google Earth Engine. ISPRS J Photogramm Remote Sens 194: 1–20. https://doi.org/10.1016/j.isprsjprs.2022.09.016 doi: 10.1016/j.isprsjprs.2022.09.016
![]() |
[27] |
Xiao Y, Ke C Q, Shen X, et al. (2023) What drives the decrease of glacier surface albedo in High Mountain Asia in the past two decades? Sci Total Environ 863: 160945. https://doi.org/10.1016/j.scitotenv.2022.160945 doi: 10.1016/j.scitotenv.2022.160945
![]() |
[28] | Alrwashdeh S S (2018) Energy production evaluation from a linear fresnel reflectors arrays with different array orientation. Inter J Engin Res Technol 11: 1811–1819. |
[29] | Alrwashdeh S S (2018) Assessment of the energy production from PV racks based on using different solar canopy form factors in Amman-Jordan. Inter J Engin Res Technol 11: 1595–1603. |
[30] |
Alrwashdeh S.S (2018) The effect of solar tower height on its energy output at Ma'an-Jordan. AIMS Energ. 6: 959–966. https://doi.org/10.3934/energy.2018.6.959 doi: 10.3934/energy.2018.6.959
![]() |
[31] | Alrwashdeh S S (2018) Assessment of photovoltaic energy production at different locations in Jordan. Int J Renew Energ Res 8: 797–804. |
[32] | Alrwashdeh S S (2019) An energy production evaluation from PV arrays with different inter-row distances. Int J Mech Prod Eng Res Develop 9: 1–10. |
[33] |
Alrwashdeh S S (2019) Energy production assessment of solar tower based on the study of the mirror shadowing and blocking effects. Univers J Mech Eng 7: 71–76. ttps://doi.org/10.13189/ujme.2019.070205 doi: 10.13189/ujme.2019.070205
![]() |
[34] |
Jin Y, Liu K, Xiong L, et al. (2022) A non-traditional variant nonlinear energy sink for vibration suppression and energy harvesting. Mech Syst Signal Process 181: 109479. https://doi.org/10.1016/j.ymssp.2022.109479 doi: 10.1016/j.ymssp.2022.109479
![]() |
[35] |
Kiwan S, Al-Gharibeh E (2020) Jordan toward a 100% renewable electricity system. Ren Energ 147: 423–436. https://doi.org/10.1016/j.renene.2019.09.004 doi: 10.1016/j.renene.2019.09.004
![]() |
[36] |
Ma'bdeh S N, Ali H H, Rabab'ah I O (2022) Sustainable assessment of using green roofs in hot-arid areas – Residential buildings in Jordan. J Buiild Eng 45: 103559. https://doi.org/10.1016/j.jobe.2021.103559 doi: 10.1016/j.jobe.2021.103559
![]() |
[37] |
Alrwashdeh S S (2021) Investigation of the energy output from PV panels based on using different orientation systems in Amman-Jordan. Case Stud Therm Eng 28. https://doi.org/10.1016/j.csite.2021.101580 doi: 10.1016/j.csite.2021.101580
![]() |
[38] |
Alrwashdeh S S (2021) Investigation of the energy output from PV panels based on using different orientation systems in Amman-Jordancorlowast. Case Stud Therm Eng 28: 101580. https://doi.org/10.1016/j.csite.2021.101580 doi: 10.1016/j.csite.2021.101580
![]() |
[39] | Alrwashdeh S S (2022) INVESTIGATION OF THE ENERGY OUTPUT OF PARABOLIC TROUGH RACKS BASED ON USING DIFFERENT RHOMBOID LAYOUT. ARPN J J Eng Appl Sci 17: 578–586. |
[40] |
Alrwashdeh S S, Ammari H (2019) Life cycle cost analysis of two different refrigeration systems powered by solar energy. Case Stud Therm Eng 16. https://doi.org/10.1016/j.csite.2019.100559 doi: 10.1016/j.csite.2019.100559
![]() |
[41] |
Alrwashdeh S S, Ammari H, Jweihan Y S, et al. (2022) Refurbishment of Existing Building toward a Surplus Energy Building in Jordan. Open Const Build Technol J 16 https://doi.org/10.2174/18748368-v16-e2208150 doi: 10.2174/18748368-v16-e2208150
![]() |
[42] |
Alrwashdeh S S, Qadourah J A, Al-Falahat A M (2022) Investigation of the Effect of Roof Color on the Energy Use of a Selected House in Amman, Jordan. Front Mech Eng 8. https://doi.org/10.3389/fmech.2022.897170 doi: 10.3389/fmech.2022.897170
![]() |
[43] | Alsarayreh A A, Al-Obaidi M A, Alrwashdeh S S, et al. (2022) Enhancement of energy saving of reverse osmosis system via incorporating a photovoltaic system, in Computer Aided Chemical Engineering. 697–702. https://doi.org/10.1016/B978-0-323-95879-0.50117-X |
[44] |
Altarawneh O R, Alsarayreh A A, Ala'a M, et al (2022) Energy and exergy analyses for a combined cycle power plant in Jordan. Case Stud Therm Eng 31. https://doi.org/10.1016/j.csite.2022.101852 doi: 10.1016/j.csite.2022.101852
![]() |
[45] |
Pastor R, Tobarra L, Robles-Gómez A, et al. (2020) Renewable energy remote online laboratories in Jordan universities: Tools for training students in Jordan. Renew Energ 149: 749–759. https://doi.org/10.1016/j.renene.2019.12.100 doi: 10.1016/j.renene.2019.12.100
![]() |
[46] |
Chenoweth J, Al-Masri R A (2021) The impact of adopting a water-energy nexus approach in Jordan on transboundary management. Environ Sci Policy 118: 49–55. https://doi.org/10.1016/j.envsci.2021.01.006 doi: 10.1016/j.envsci.2021.01.006
![]() |
[47] |
Murtadha T K, dil Hussein A A, Alalwany A A H, et al. (2022)Improving the cooling performance of photovoltaic panels by using two passes circulation of titanium dioxide nanofluid. Case Stud Therm Eng. 36. https://doi.org/10.1016/j.csite.2022.102191 doi: 10.1016/j.csite.2022.102191
![]() |
[48] |
Schuetze B, Hussein H (2023) The geopolitical economy of an undermined energy transition: The case of Jordan. Energ Policy 180: 113655. https://doi.org/10.1016/j.enpol.2023.113655 doi: 10.1016/j.enpol.2023.113655
![]() |
[49] |
Alrwashdeh S S (2022) Energy sources assessment in Jordan. Results Eng 13. https://doi.org/10.1016/j.rineng.2021.100329 doi: 10.1016/j.rineng.2021.100329
![]() |
1. | Qinglan Ma, Yulong Shen, Wei Guo, Kaiyan Feng, Tao Huang, Yudong Cai, Machine Learning Reveals Impacts of Smoking on Gene Profiles of Different Cell Types in Lung, 2024, 14, 2075-1729, 502, 10.3390/life14040502 | |
2. | Wenwei Wu, Tao He, Xiaying Hao, Kaiwei Xu, Ji Zeng, Jiahui Gu, Lei Chen, Zhengmao Li, Machine learning based method for analyzing vibration and noise in large cruise ships, 2024, 19, 1932-6203, e0307835, 10.1371/journal.pone.0307835 | |
3. | Qinglan Ma, Yu-Hang Zhang, Wei Guo, Kaiyan Feng, Tao Huang, Yu-Dong Cai, Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types, 2024, 14, 2075-1729, 1032, 10.3390/life14081032 | |
4. | Feiming Huang, Qian Gao, XianChao Zhou, Wei Guo, KaiYan Feng, Lin Zhu, Tao Huang, Yu-Dong Cai, Prediction of Solubility of Proteins in Escherichia coli Based on Functional and Structural Features Using Machine Learning Methods, 2024, 43, 1572-3887, 983, 10.1007/s10930-024-10230-z | |
5. | Bo Zhou, Bing Ran, Lei Chen, A GraphSAGE-based model with fingerprints only to predict drug-drug interactions, 2024, 21, 1551-0018, 2922, 10.3934/mbe.2024130 | |
6. | Anhui Yin, Lei Chen, Bo Zhou, Yu-Dong Cai, CMAGN: circRNA–miRNA association prediction based on graph attention auto-encoder and network consistency projection, 2024, 25, 1471-2105, 10.1186/s12859-024-05959-4 | |
7. | QingLan Ma, Mei Meng, XianChao Zhou, Wei Guo, KaiYan Feng, Tao Huang, Yu‐Dong Cai, Identification of Key Genes in Fetal Gut Development at Single‐Cell Level by Exploiting Machine Learning Techniques, 2024, 1615-9853, 10.1002/pmic.202400104 | |
8. | YuSheng Bao, QingLan Ma, Lei Chen, KaiYan Feng, Wei Guo, Tao Huang, Yu-Dong Cai, Recognizing SARS-CoV-2 infection of nasopharyngeal tissue at the single-cell level by machine learning method, 2025, 177, 01615890, 44, 10.1016/j.molimm.2024.12.004 | |
9. | Lei Chen, Shiyi Zhang, Bo Zhou, Herb-disease association prediction model based on network consistency projection, 2025, 15, 2045-2322, 10.1038/s41598-025-87521-7 | |
10. | QingLan Ma, Jingxin Ren, Lei Chen, Wei Guo, KaiYan Feng, Tao Huang, Yu-Dong Cai, Identifying Key Clinical Indicators Associated with the Risk of Death in Hospitalized COVID-19 Patients, 2025, 20, 15748936, 359, 10.2174/0115748936306893240720192301 |
Symbol | Descriptions |
Acd | Association matrix between circRNAs and diseases |
Adi | Association matrix between diseases and miRNAs |
Acc | Association matrix between circRNAs |
Aci | Association matrix between circRNAs and miRNAs |
Aim | Association matrix between miRNAs and mRNAs |
Amd | Association matrix between mRNAs and diseases |
Add | Association matrix between diseases |
Vij | Raw feature vector of the i-th node in the j-th network yielded by RWR algorithm |
Xi | Final feature vector of the i-th node |
Wij | Context feature vector of the i-th node in the j-th network |
Aij | Reconstructed feature vector of the i-th node in the j-th network |
Aijk | The k-th component of the reconstructed feature vector of the i-th node in the j-th network |
Type | Entry | Number |
Nodes | circRNA | 474 |
Disease | 61 | |
miRNA | 632 | |
mRNA | 1365 | |
Association | circRNA←→ disease | 571 |
circRNA←→circRNA | 46 | |
circRNA←→miRNA | 1313 | |
miRNA←→disease | 912 | |
miRNA←→mRNA | 1194 | |
mRNA←→disease | 1931 | |
disease←→disease | 162 |
Classification algorithm (feature dimension) | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) |
XGB (20) | 94.40 | 94.39 | 94.42 | 94.40 | 94.40 |
ABRF (20) | 93.79 | 93.66 | 95.51 | 91.95 | 95.62 |
RF (20) | 93.00 | 92.80 | 95.45 | 90.38 | 95.62 |
MP (50) | 89.40 | 89.65 | 87.75 | 91.78 | 87.03 |
SVM (50) | 67.86 | 68.43 | 67.22 | 69.71 | 66.02 |
LR (50) | 64.01 | 63.91 | 64.14 | 63.75 | 64.28 |
Addition of mDAs | Addition of meta-path-induced networks | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) | AUC (%) |
⊆ | √ | 93.08 | 93.13 | 92.48 | 93.88 | 92.30 | 98.31 |
√ | ⊆ | 91.24 | 91.16 | 92.17 | 90.20 | 92.29 | 97.52 |
√ | √ | 94.40 | 94.39 | 94.42 | 94.40 | 94.40 | 98.46 |
Meta-path-induced network | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) |
X1 | 92.21 | 92.25 | 91.88 | 92.65 | 91.77 |
X2 | 92.91 | 92.94 | 92.41 | 93.52 | 92.30 |
X3 | 66.11 | 65.41 | 66.84 | 64.10 | 68.13 |
X4 | 94.05 | 94.06 | 93.75 | 94.40 | 93.70 |
X5 | 92.30 | 92.31 | 92.16 | 92.47 | 92.12 |
X6 | 91.59 | 91.63 | 91.34 | 91.94 | 91.24 |
X7 | 92.21 | 92.23 | 92.00 | 92.47 | 91.95 |
X8 | 91.33 | 91.42 | 90.58 | 92.30 | 90.37 |
X9 | 91.42 | 91.48 | 90.87 | 92.13 | 90.72 |
X10 | 92.56 | 92.58 | 92.34 | 92.82 | 92.29 |
X11 | 91.42 | 91.53 | 90.44 | 92.65 | 90.20 |
X12 | 91.07 | 91.12 | 90.67 | 91.59 | 90.55 |
Model | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) |
PCDA-HNMP | 94.40 | 94.39 | 94.42 | 94.40 | 94.40 |
CircWalk [25]$ | 92.09 | 92.08 | 92.36 | 91.83 | 92.35 |
DMFCDA [29]$ | 83.69 | 83.69 | 81.55 | 87.79 | 79.60 |
GCNCDA [26]$ | 74.52 | 74.90 | 73.79 | 76.17 | 72.87 |
SIMCCDA [31]$ | 83.36 | 16.40 | 9.10 | 84.54 | 83.34 |
$: The performance of these models was directly obtained from [25]. |
Symbol | Descriptions |
Acd | Association matrix between circRNAs and diseases |
Adi | Association matrix between diseases and miRNAs |
Acc | Association matrix between circRNAs |
Aci | Association matrix between circRNAs and miRNAs |
Aim | Association matrix between miRNAs and mRNAs |
Amd | Association matrix between mRNAs and diseases |
Add | Association matrix between diseases |
Vij | Raw feature vector of the i-th node in the j-th network yielded by RWR algorithm |
Xi | Final feature vector of the i-th node |
Wij | Context feature vector of the i-th node in the j-th network |
Aij | Reconstructed feature vector of the i-th node in the j-th network |
Aijk | The k-th component of the reconstructed feature vector of the i-th node in the j-th network |
Type | Entry | Number |
Nodes | circRNA | 474 |
Disease | 61 | |
miRNA | 632 | |
mRNA | 1365 | |
Association | circRNA←→ disease | 571 |
circRNA←→circRNA | 46 | |
circRNA←→miRNA | 1313 | |
miRNA←→disease | 912 | |
miRNA←→mRNA | 1194 | |
mRNA←→disease | 1931 | |
disease←→disease | 162 |
Classification algorithm (feature dimension) | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) |
XGB (20) | 94.40 | 94.39 | 94.42 | 94.40 | 94.40 |
ABRF (20) | 93.79 | 93.66 | 95.51 | 91.95 | 95.62 |
RF (20) | 93.00 | 92.80 | 95.45 | 90.38 | 95.62 |
MP (50) | 89.40 | 89.65 | 87.75 | 91.78 | 87.03 |
SVM (50) | 67.86 | 68.43 | 67.22 | 69.71 | 66.02 |
LR (50) | 64.01 | 63.91 | 64.14 | 63.75 | 64.28 |
Addition of mDAs | Addition of meta-path-induced networks | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) | AUC (%) |
⊆ | √ | 93.08 | 93.13 | 92.48 | 93.88 | 92.30 | 98.31 |
√ | ⊆ | 91.24 | 91.16 | 92.17 | 90.20 | 92.29 | 97.52 |
√ | √ | 94.40 | 94.39 | 94.42 | 94.40 | 94.40 | 98.46 |
Meta-path-induced network | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) |
X1 | 92.21 | 92.25 | 91.88 | 92.65 | 91.77 |
X2 | 92.91 | 92.94 | 92.41 | 93.52 | 92.30 |
X3 | 66.11 | 65.41 | 66.84 | 64.10 | 68.13 |
X4 | 94.05 | 94.06 | 93.75 | 94.40 | 93.70 |
X5 | 92.30 | 92.31 | 92.16 | 92.47 | 92.12 |
X6 | 91.59 | 91.63 | 91.34 | 91.94 | 91.24 |
X7 | 92.21 | 92.23 | 92.00 | 92.47 | 91.95 |
X8 | 91.33 | 91.42 | 90.58 | 92.30 | 90.37 |
X9 | 91.42 | 91.48 | 90.87 | 92.13 | 90.72 |
X10 | 92.56 | 92.58 | 92.34 | 92.82 | 92.29 |
X11 | 91.42 | 91.53 | 90.44 | 92.65 | 90.20 |
X12 | 91.07 | 91.12 | 90.67 | 91.59 | 90.55 |
Model | ACC (%) | F1-measure (%) | Precision (%) | SN (%) | SP (%) |
PCDA-HNMP | 94.40 | 94.39 | 94.42 | 94.40 | 94.40 |
CircWalk [25]$ | 92.09 | 92.08 | 92.36 | 91.83 | 92.35 |
DMFCDA [29]$ | 83.69 | 83.69 | 81.55 | 87.79 | 79.60 |
GCNCDA [26]$ | 74.52 | 74.90 | 73.79 | 76.17 | 72.87 |
SIMCCDA [31]$ | 83.36 | 16.40 | 9.10 | 84.54 | 83.34 |
$: The performance of these models was directly obtained from [25]. |