
Solar energy is one of the most important renewable energy sources due to its wide availability and applicability. One way to use this resource is by building-integrated photovoltaics (BIPV). Therefore, it is essential to develop a scientific map of BIPV systems and a comprehensive review of the scientific literature that identifies future research directions. For that reason, the bibliometric research methodology enables the quantification and evaluation of the performance, quality and influence of the generated maps and their elements. In this regard, an analysis of the scientific production related to BIPV, indexed from 2001 to 2022, was carried out using the Scopus database. This was done using a scientific mapping approach via the SciMAT tool to analyze the co-occurrence of terms through clustering techniques. The BIPV was integrated with the themes of buildings, investments, numerical models, office buildings, photovoltaic modules, roofs, solar cells and zero-energy buildings. As photovoltaic technology progresses, the production of flexible PV elements is increasing in lieu of silicon substrate-based PV elements, and this is of current scientific interest. The evaluations of BIPVs in various climatic contexts are encouraging in warm and sunny climates. BIPVs demonstrated high-energy generation, while in temperate climates, BIPV windows exhibited a reduction in heating and cooling loads, indicating notable efficiency. Despite significant benefits, BIPVs face challenges such as upfront costs, integration complexities and durability concerns. Therefore, silicon solar cells are considered a cross-cutting theme within the BIPV research field. It is highlighted that this study provides a comprehensive scientific mapping and critical review of the literature in the field of BIPV systems. This bibliometric analysis not only quantifies the performance and quality of the generated maps but also identifies key thematic areas that have evolved.
Citation: Eliseo Zarate-Perez, Juan Grados, Santiago Rubiños, Herbert Grados-Espinoza, Jacob Astocondor-Villar. Building-integrated photovoltaic (BIPV) systems: A science mapping approach[J]. AIMS Energy, 2023, 11(6): 1131-1152. doi: 10.3934/energy.2023052
[1] | Hui Xu, Jun Kong, Mengyao Liang, Hui Sun, Miao Qi . Video behavior recognition based on actional-structural graph convolution and temporal extension module. Electronic Research Archive, 2022, 30(11): 4157-4177. doi: 10.3934/era.2022210 |
[2] | Zhiming Cai, Liping Zhuang, Jin Chen, Jinhua Jiang . Lightweight high-performance pose recognition network: HR-LiteNet. Electronic Research Archive, 2024, 32(2): 1145-1159. doi: 10.3934/era.2024055 |
[3] | Longkui Jiang, Yuru Wang, Weijia Li . Regress 3D human pose from 2D skeleton with kinematics knowledge. Electronic Research Archive, 2023, 31(3): 1485-1497. doi: 10.3934/era.2023075 |
[4] | Chengyong Yang, Jie Wang, Shiwei Wei, Xiukang Yu . A feature fusion-based attention graph convolutional network for 3D classification and segmentation. Electronic Research Archive, 2023, 31(12): 7365-7384. doi: 10.3934/era.2023373 |
[5] | Yunfei Tan, Shuyu Li, Zehua Li . A privacy preserving recommendation and fraud detection method based on graph convolution. Electronic Research Archive, 2023, 31(12): 7559-7577. doi: 10.3934/era.2023382 |
[6] | Bingsheng Li, Na Li, Jianmin Ren, Xupeng Guo, Chao Liu, Hao Wang, Qingwu Li . Enhanced spectral attention and adaptive spatial learning guided network for hyperspectral and LiDAR classification. Electronic Research Archive, 2024, 32(7): 4218-4236. doi: 10.3934/era.2024190 |
[7] | Muhammad Ahmad Nawaz Ul Ghani, Kun She, Muhammad Usman Saeed, Naila Latif . Enhancing facial recognition accuracy through multi-scale feature fusion and spatial attention mechanisms. Electronic Research Archive, 2024, 32(4): 2267-2285. doi: 10.3934/era.2024103 |
[8] | Wenrui Guan, Xun Wang . A Dual-channel Progressive Graph Convolutional Network via subgraph sampling. Electronic Research Archive, 2024, 32(7): 4398-4415. doi: 10.3934/era.2024198 |
[9] | Xinzheng Xu, Xiaoyang Zhao, Meng Wei, Zhongnian Li . A comprehensive review of graph convolutional networks: approaches and applications. Electronic Research Archive, 2023, 31(7): 4185-4215. doi: 10.3934/era.2023213 |
[10] | Yongsheng Lei, Meng Ding, Tianliang Lu, Juhao Li, Dongyue Zhao, Fushi Chen . A novel approach for enhanced abnormal action recognition via coarse and precise detection stage. Electronic Research Archive, 2024, 32(2): 874-896. doi: 10.3934/era.2024042 |
Solar energy is one of the most important renewable energy sources due to its wide availability and applicability. One way to use this resource is by building-integrated photovoltaics (BIPV). Therefore, it is essential to develop a scientific map of BIPV systems and a comprehensive review of the scientific literature that identifies future research directions. For that reason, the bibliometric research methodology enables the quantification and evaluation of the performance, quality and influence of the generated maps and their elements. In this regard, an analysis of the scientific production related to BIPV, indexed from 2001 to 2022, was carried out using the Scopus database. This was done using a scientific mapping approach via the SciMAT tool to analyze the co-occurrence of terms through clustering techniques. The BIPV was integrated with the themes of buildings, investments, numerical models, office buildings, photovoltaic modules, roofs, solar cells and zero-energy buildings. As photovoltaic technology progresses, the production of flexible PV elements is increasing in lieu of silicon substrate-based PV elements, and this is of current scientific interest. The evaluations of BIPVs in various climatic contexts are encouraging in warm and sunny climates. BIPVs demonstrated high-energy generation, while in temperate climates, BIPV windows exhibited a reduction in heating and cooling loads, indicating notable efficiency. Despite significant benefits, BIPVs face challenges such as upfront costs, integration complexities and durability concerns. Therefore, silicon solar cells are considered a cross-cutting theme within the BIPV research field. It is highlighted that this study provides a comprehensive scientific mapping and critical review of the literature in the field of BIPV systems. This bibliometric analysis not only quantifies the performance and quality of the generated maps but also identifies key thematic areas that have evolved.
Human action recognition is one of the main research areas in the study of computer vision and machine learning, aiming to understand and assign a label to each action [1,2]. It appears in a wide range of applications such as video surveillance systems [3,4], human-computer interaction and robotics for human behavior characterization [5].
In general, most existing approaches in human action recognition fall into two main categories: RGB video based approaches [6,7,8] and skeleton-based methods [9,10,11,12,13,14,15] with respect to the types of input data. The RGB video based approaches are computationally expensive in processing RGB pixel information of videos, while the skeleton-based models are more efficient as they only need to calculate the 2D or 3D human joint point data. In addition, skeleton-based action recognition stands out due to its great potential in adaptability to dynamic circumstance and illumination variation [16,17,18,19,20].
It is well-known that extracting effective features for background separation and representation learning in skeleton-based action recognition is a challenging problem. The traditional deep learning-based methods are good at assembling skeletons as a set of independent joint features [21,22] or pseudo-images [23,24], i.e., representing skeleton data as vector sequences or two-dimensional vectors. With the constructed features, models and spatial-temporal correlations between them are given and learned. However, such vector-based representation cannot fully exploit the intrinsic relationships of human joints on account of ignoring the dependencies and inherent connections of the related ones. Thereupon, graph convolution networks (GCN) [25] have been used to capture inter-joint interactions, in which the natural connections of human joint points can be modeled as a spatial graph that corresponds to a graph convolution layer. As such, GCN-based spatial-temporal model is built at different time steps.
For robust action recognition, the GCN-based skeleton graph prefers to extract multi-scale structural features, which are set subjectively [25]. However, the artificially designed graph structures involve connections that may hinder human actions in actual motion, resulting in models that are insufficient to effectively capture the spatio-temporal joint dependencies of complex regions. For example, there is usually a close relationship between hands and legs in the "running" action. However, it is difficult to obtain this relationship by an artificially constructed multi-scale structure. The recent approaches based on the self-attention mechanism (SA) can be used to learn the interactions between joints [26,27]. However, these methods only verify the feasibility of SA experimentally. To address these problems, we propose a unified framework, namely SA-GCN, that integrates the SA into the GCN for low-quality motion video data with fixed viewing angle*. Compared with existing methods, this model fully exhibits the possible positive connections between different nodes, as shown in Figure 1.
*This dataset includes the raw data without regularization, which is introduced in ST-GCN [25].
To the best of our knowledge, this is the first study which pursues the impact of the SA mechanism on GCN in the area of action recognition (the latest and most representative study on SA-augmented GCNs was mainly to tackle the over-fitting and over-smoothing problems, e.g., see [28]). On the one hand, SA-GCN uses GCN as the basic component of the attention mechanism to extract the weights of joint point data in different frames of the same video. On the other hand, this component is beneficial to deepen the correlations of disconnected nodes of human joints, so that the learned long-distance temporal relationship is reflected in distant frames.
Extensive experiments on NTU-RGBD [29] and NTU120 [30] datasets demonstrate a superior performance of SA-GCN in terms of accruacy, and in comparison with a range of state of the art models across various skeleton-based action recognition tasks.
In summary, this paper claims three main contributions:
1) SA-GCN is a unified spatio-temporal graph convolution framework, which extracts better features by automatically learning the weight information between joint points at different scales. In addition, SA-GCN introduces a new SA mechanism with different scales for characterizing the multi-scale dependencies of joints, which greatly promotes the effectiveness in the learning of spatio-temporal features.
2) The proposed SA mechanism combines the structural informtion of the graph and the self-learning fusion features.
3) The proposed model achieves competitive performance for skeleton-based action recognition.
Graph neural networks (GNNs) have shown a great potential in computing irregular nodes in a graph [31]. Existing GNN models are mainly divided into two categories: spectral-based methods [32] and spatial-based approaches [27]. The former is based on feature decomposition, and performs convolution operations in the graph spectral domain. Thus the computational efficiency is limited due to the large amount of computation brought by feature decomposition. The latter (i.e., spatial-based GNNs) directly implements the operation of convolution on graph nodes and their neighbors in the spatial domain, thus is able to overcome the limitations of spectral-based methods. The method presented in this paper belongs to the second category.
Multi-scale GNNs are often used to capture the features for adjacent points. For example, [33] obtains image features by aggregating the higher-order polynomials of graph adjacency matrices. [34] represents multi-scale features by utilizing the graph adjacency matrix with a higher power. Nonetheless, these approaches inevitably and undesirably involve the artificial setting of structures for the adjacent points. Moreover, it is insufficient to rely only on the experimental verification that the setting of structures is optimal. Hence, the SA mechanism is proposed to lighten the problem of human interference.
Attention mechanism [26] is critical in human cognition, which has revolutionized natural language processing and has been applied in image recognition [35], image synthesis [36] and video processing [37]. It is well-known that the human visual system captures structural features through partial glimpse and salient objects, rather than processing the whole at once [38].
Recently, attention has been proposed to improve the performance of convolutional neural networks (CNNs) for large-scale classification tasks. For example, a residual attention network (RAN) was introduced in an end-to-end training fashion [39], in which the attention modules are stacked and attention-aware features change adaptively as layers go deeper. Similarly, the channel-wise attention is incorporated into squeeze-and-excitation networks, which adaptively recalibrates the global average-pooled features by explicitly using interdependencies between channels [40].
However, the attention mechanisms have rarely been used in GCNs for action recognition. Considering the proven advantages of attention in computer vision tasks [41], it is natural to integrate SA into GCN as done in this paper, which reinterprets mathematically the positional relationship of each feature fusion in the attention mechanism. Experiments of large-scale dataset further demonstrate that SA mechanism can bring improvements in performance.
Traditional methods for skeleton-based action recognition can be divided into two types: handcrafted feature-based models [42] and deep learning-based approaches [43]. However, the performance of handcrafted feature-based methods is often unsatisfactory since local factors are used and the most important structural connections of the human body are ignored. The emergence of deep learning-based methods significantly improve the performance of skeleton recognition. For example, spatial graph convolution and interleaved temporal convolution are used for spatio-temporal characterization of the skeleton data (ST-GCN) [25]. Similarly, a multi-scale module is introduced to implement graph convolution modeling by raising the graph adjacency matrix of skeleton to a higher power [44]. In addition, the model of multi-scale adjacency matrix can also be performed by generating human poses and adding spatial graph convolution [45]. In [27], by using video frame segmentation, a set of videos is divided into four groups according to time and space for processing and combination, which expands the receptive field and achieves a great success in computer vision.
Different from the above methods, the graph convolution method proposed in this paper uses a SA mechanism. More specifically, the proposed approach uses GCN as part of the SA mechanism instead of forcing them to be summed as in existing methods [26]. In this way, the proposed method can better re-express the weights of the connection matrix.
In this section, we present the proposed SA-GCN framework in detail.
Skeleton data usually consists of a sequence of frames, in which each frame has a set of human joint coordinates in 2D or 3D form [25]. Naturally, a spatial-temporal graph for skeleton data can be constructed, in which the nodes are human joint points and the edges include the natural connections of joints in human body structure over time in motion.
The skeleton data is formulated as a undirected spatial temporal graph G=(V,E), where V={vti|t=1,...,T,i=1...,N} is the set of nodes including all the joints, T denotes video frames which feature both intra-body and inter-frame connectons, N is the total number of joints and E is the edge set. Usually, E is split into two subsets. One represents the intra-skeleton connection in each frame and the other depicts the inter-frame edges, which corresponds to the same joint point connections in consecutive frames. In fact, the ST-GCN is composed of spatial graph convolution and temporal graph convolution.
In the spatial dimension, the graph convolution is manipulated on each node and the corresponding neighbor set. For each node vti, the neighbor set is given as:
BS(vti)={vtj|d(vtj,vti)≤D}, |
where d(vtj,vti) is the shortest distance of any path from vtj to vti. D=1 means the 1-neighbor set of joint nodes. That is, if there is a connection between vtj and vti, then d(vtj,vti) is 1, and if there is no connection, it is 0.
According to the spatial partition strategies introduced in [25], the above neighbor set is divided into three labels, including: 1) the root node itself; 2) the centripetal subset which contains the adjacent nodes closer to the center of gravity than the root; 3) the centrifugal subset, which includes the adjacent nodes that are further away from the center of gravity than the root, as shown in (a) of Figure 1 (red, blue and orange represent three division strategies respectively). Hence, the spatial graph convolution is defined as:
fout(vti)=∑vtj∈BS(vti)1Zti(vtj)fin(vtj)⋅W(Lti(vtj)) | (3.1) |
where fin(⋅) and fout(⋅) represent input and output features of this convolution layer, vtj denotes the j-th node of the graph on frame t, and BS(vti) is the sampling area of the convolution, characterized by the vertex with a distance of 1 from the target vertex. W(⋅) is the weighting function similar to convolution, which gives a weight vector based on the given data. Lti(⋅) is the label function, which allocates a label from 0 to K−1 to each node in BS(vti). Usually, we can set K = 3 [25]. Zti(⋅) denotes the number of nodes in the subset of BS(⋅) with the label Lti.
In practice, the skeletal sequence is usually denoted as a 3-order tensor of shape C×T×N, where C,T and N are respectively the numbers of channels, frames and joints. Thus, the implementation of Eq (3.1) can be transformed into the tensor format, such as fin∈RCin×T×N and fout∈RCout×T×N that represent the input and output feature maps, respectively. To perform multiplication of tensors, we reorder the elements of a tensor into a matrix through unfolding or flattening.
More technically, the connections between nodes in the skeleton graph are represented as an adjacency matrix A∈{0,1}N×N. The adjacency matrix corresponding to the k-th subset of the neighbor set BS(vti) can be denoted as Ak.
Let Fin∈RN×Cin be the input features of all joints in each frame, where Cin is the dimensionality of the input feature, and Fout∈RN×Cout is the output feature from the spatial graph convolution, where Cout is the dimensionality of output feature. Therefore, the implementation of the above Eq (3.1) can be rewritten as:
Fout=K−1∑k=0Mk⊙˜AkFinWk, | (3.2) |
where ˜Ak=Λ−12k(Ak+I)Λ−12k∈RN×N is the normalized adjacent matrix for each partition label. Λ∈RN×N is the diagonal degree matrix with Λiik=∑j(Aijk)+α, and α is a small number to avoid division by zero. ⊙ is the element-wise multiplication. Mk∈RN×N and Wk∈RCin×Cout are learnable weight matrices for each partition label, which capture the strength of connections (i.e., edge weights) and the feature importance respectively.
In the temporal dimension, since the graph is given by corresponding joints in the two adjacent frames, the number of neighbors for each node is 2. Accordingly, the temporal graph convolution is similar to the classical convolutional operation, which is performed on the T×N output feature matrix mentioned above. Generally, the kernel size is set to 9, as in [10,11,25]. However, it should be emphasized that the respective fields for the spatial and temporal dimensions are set artificially. Thus, it is necessary that a SA mechanism is enforced to reduce the dependance on human interference.
As illustrated in the previous Section 3.1, the graph convolution can be obtained by a standard 2D convolution and a multiplication between the resulting tensor and the normalized adjacency marix under the spatial temporal cases. To automatically capture the relationships within the matrix, we introduce a SA operation into ST-GCN to further focus on the important regions and increase the model's attention to important joint point information.
The SA function can be viewed as a mapping from the query and a set of key-value pairs to an output [8,26]. For an input tensor of shape (Cin,T,N), we flatten it to the corresponding matrix X∈RCinT×N and perform the single head attention. Hence, the output of the SA mechanism can be formulated as:
Attention(Q,K,V)=softmax((QKT)√dk)V, | (3.3) |
where WQ,WK∈RCin×dk and WV∈RCin×dk are learned linear transformations that map the input X to queries Q=XWQ, keys K=XWK and values V=XWV. dk is the depth of queries and keys.
Specifically, SA explores the correlations in a skeletal sequence, which are calculated for each position by the weighted sum over all positions. Moreover, the weight of each position in the similarity matrix is dynamic. Let x denote the input signal and y be the output. The SA module can be written as:
y(x)=σ(h(f(x)⋅g(x))+x), | (3.4) |
where f(x)=softmax(θ(x)Tϕ(x)) is used to generate the similarity matrix. θ(x)=Wθx, ϕ(x)=Wϕx, g(x)=Wgx, h(x)=Whx, and Wh, Wθ, Wϕ and Wg are learnable. σ(⋅) is the activation function. In fact, here the similarity matrix is a weighted adjacent matrix in Eq (3.2). Hence, Equation (3.4) is named as SA-GCN.
It is common sense for each action recognition category that the motion and nodes in different frames are different. Therefore, it is not enough to express the intra-skeleton connections in each frame and the inter-frame correlations by only learning a total joint point weight matrix as in ST-GCN (Eq (3.4)). To address this problem, we propose to learn and represent all weights of the node for each frame by the attention mechanism.
In traditional methods of attention, the input is usually expressed as the addition of the input matrix and the position variable, as shown in Figure 2(a). It is formally defined as:
xout=xin+A | (3.5) |
where A can be represented as the positional relationship of the input nodes, and xin and xout represent the original data and the fusion data with the position variable.
However, in this paper we do not use the simple addition to fuse the input data and location variables. Instead, as shown in Figure 1(b), feature fusion is expressed as:
xout=Dnormxin, | (3.6) |
where xin and xout represent the input and output matrices of the attention mechanism. Dnorm denotes the Laplacian regularization matrix, which represents the position information that can be formulated as Dnorm=Λ−12(A+I)Λ−12. A is the adjacency matrix, I is the identity matrix, and Λ is the degree matrix of the node. In this paper, Dnorm is implemented as Mk⊙˜Ak.
Compared to the traditional methods, the regularization matrix Dnorm in Eq (3.6) guarantees the adaptive choice of the position matrix instead of manual setting. Simultaneously, we utilize the multiplication between input matrix and the position matrix (i.e., Dnormfin), rahter than addition of the two in the traditional transformer architecture.
Next, we process the resulting matrix of Eq (3.6) to find the weight relationship between the different nodes of the input matrix. By integrating the attention module into Eq (3.6), the output result can be converted into multiple data matrices containing different information streams, as shown in Figure 3. Therefore, the conversion formula for input data is denoted as:
Q=DnormxinWQ,K=DnormxinWK,V=DnormxinWV, | (3.7) |
where WQ, WK and WV are the corresponding parameter matrices. dk in Eq (3.3) is defined as the row vector of the input data, that is, the number of joint points. Specifically, it is proved experimentally that using batchnorm in Eq (3.3) is more effective than the softmax function.
We then set Watt=Attention(Q,K,V). By using Eqs (3.3), (3.4) and (3.6), the proposed network can be written as:
fout=σ(Watt). | (3.8) |
We can see from Eq (3.8) that SA-GCN assembles a single module of the attention mechanism in a form similar to the spatial GCN (Eq (3.4)). Moreover, the proposed attention mechanism is heuristic since the mathematical formula of Eq (3.7) coincides with the input form Eq (3.6) proposed in this paper. The flow chart of SA-GCN is generalized in Figure 4, in which the proposed weight matrix of each frame is combinated with the original joint point weight matrix Mk of ST-GCN in Eq (3.2).
An important theoretical advantage of SA-GCN is that the attention mechanism in SA-GCN is interpretable. Specifically, the standard GCN is used as a component of the attention mechanism, which has a more theoretical derivation in the processing of data location variables. This enhances the dependency of the data on the existence of different joints. Specifically, the existing methods only describe the relationship between key nodes and adjacent joint points. However, the features processed by the proposed SA in this paper more fully demonstrate the positive influence between key points and all other nodes involved in the action (for example, the mapping strategy from (a) to (b) shown in Figure 1).
The previous subsection (i.e., Section 3.3) is focused on improving the spatial method in SA-GCN, for which the attention mechanism is used to extract the joint weights of different frames in a video. On the other hand, it is well recognized that temporal modeling is also essential for video action recognition. For temporal modeling, we adopt the classical 2D graph convolution with kernel size 9 as introduced in the original ST-GCN [25]. That is
fout=Cov2D[K+1][fin]. | (3.9) |
where D=1 and K=9.
To deploy the proposed SA-GCN on video data, we build a concrete network structure upon the spatio-temporal graph convolution, as shown in Figure 4(a). Specifically, the network utilizes 10 SA-GCNs to perform feature fusion and feature extraction on the input spatio-temporal skeleton data. And the feature information is aggregated by global average pooling. Finally, the prediction result is given by the fully connected layer and the softmax layer. Among them, each SA-T block in (b) of Figure 4 is composed of a temporal graph convolution and a spatial graph convolution respectively, to extract a mixture of spatio-temporal features, and it also uses the residual structure to to prevent data overfitting and improve the generalization ability of the model, as shown in Figure 4(b).
In summary, SA-GCN jointly learns the total weight matrix and the weight matrix of each frame, which are expected to greatly improve the performance of the original ST-GCN. The next section will present the experimental results and illustrate the potential of SA-GCN.
NTU-60 RGB+D Dataset. This is currently the largest and most widely used indoor action recognition dataset, which contains 56,000 action clips across 60 action categories [29]. The clips were performed by 40 volunteers in various age groups ranging from 10 to 35 years old. Each action was obtained by three cameras at the same height with different horizontal angles: −45∘, 0∘, and 45∘. This dataset includes location information of 3D joints for each frame detected by Kinect depth sensors, where 25 joints are contained for each subject of the skeleton sequences and each video has no more than 2 subjects. We evaluate the proposed model using two benchmarks derived from the NTU-60 RGB+D dataset, according to the metrics introduced in [25]. The two benchmarks are:
1) Cross-subject (X-Sub/CS): The dataset based on this benchmark is divided into a training set including 40,320 videos and a validation set containing 16,560 videos, where the persons within the videos are different.
2) Cross-view (X-View/CV): The dataset based on this benchmark has the cameras with different horizontal angles, where 37,920 videos in the training set are obtained from the angles (0∘ and 45∘), and 18,960 videos in the validation set are captured from the angle −45∘).
NTU-120 RGB+D Dataset. This is a large-scale dataset for 3D skeleton-based action recognition, which is obtained from 106 distinct subjects and contains more than 114 thousand video samples and 8 million frames [30]. In fact, this dataset can be seen as an expansion of the NTU-60 RGB+D dataset in the number of performers and action categories. It has 120 different action classes including daily actions, mutual behaviors, and health-related activities. Two specific benchmarks are derived from this dataset, namely X-Sub and cross-setup.
1) X-Sub: The dataset consists of training and testing groups, where each group contains 53 subjects.
2) X-View: The dataset divides samples into training and testing groups, in which the even setup samples are used for training and the odd setup samples are used for testing.
We follow this rule and give the top-1 accuracy about X-Sub and X-View in all experiments.
The model proposed in this paper is composed of nine SA-GCN stacks. The first 4 SA-GCN channels of the model are 64, the number of channels is doubled from the 5th to the 7th, and the last three are 256. The convolution stride is set to 2. For fairness in the experimental comparisons, the parameters and number of channels used in different comparative models are the same. Specifically, we train the model for 80 epochs using stochastic gradient descent (SGD) with Nesterov momentum (0.9), batch size 24, and initial learning rate 0.1. In the tenth and fifth iterations, the learning rate is reduced tenfold, respectively. For video data with more than two people, only the first two people are selected, and all skeleton data is filled to T = 300. In addition, this paper chooses a data preprocessing method similar to ST-GCN, in which only skeleton data is selected as the comparison data and the influence of other data streams is not considered. All experiments were conducted on PyTorch with 4 TITANX GPUs.
In this section, we perform our proposed method on the NTU-60 and NTU-120 datasets to compare with 3 previous state of the art approaches, which include ST-GCN [25], two-stream adaptive GCN (2s-AGCN) [27] and GCN-NAS† [46]. For fair comparisons, we use the data extraction method suggested by ST-GCN [25] in all experiments, which is raw data without preprocessing such as regularization.
†GCN-NAS: learning GCN for skeleton-based human action recognition by neural searching.
a ST-GCN [25]: It is a generic representation of skeleton sequences for action recognition by extending GCNs to a spatial-temporal graph model.
b 2s-AGCN [27]: It is the two-stream adaptive GCN for skeleton-based action recognition, which models both the first-order and the second-order information simultaneously to improve the acuuracy of recognition.
c GCN-NAS [46]: It is the automatically designed GCN for skeleton-based action recognition based on neural architecture search, which uses a sampling-and memory-efficient evolution strategy to find the optimal architecture for recognition.
d STAR [47]: It is an action recognition model based on sparse transformer.
e TSTE [48]: It is a two-stream transformer encoder network based on spatio-temporal feature and shape transformation.
f TAG [49]: It is a generalization of ST-GCN based on weak feature extraction.
The proposed model improves the spatial convolution in ST-GCN by adding a SA mechanism, which realizes the application of the attention module in GCN and strengthens the ability of the model to extract joint weights.
Table 1 summarizes the results on the two benchmarks of the NTU-60 dataset. The comparison is based on the same data processing method and the same skeleton data flow. We can find that SA-GCN gives the best results on X-view and X-Sub. For instance, compared to ST-GCN [25], the accuracy of SA-GCN is increased by about 3.2% on both X-View and X-Sub. On X-View, the accuracy of SA-GCN is 1.7% higher than that of 2s-AGCN [27] and GCN-NAS [46]. On X-Sub, SA-GCN achieves an improvement of 5.6% and 2.0% than 2s-AGCN [27] and GCN-NAS [46] respectively.
Methods | X-View | X-Sub |
ST-GCN[25] | 88.3 | 81.5 |
2s-AGCN[27] | 89.8 | 79.1 |
GCN-NAS[46] | 89.8 | 82.7 |
STAR[47] | 89.0 | 83.4 |
TSTE[48] | 85.3 | 80.5 |
TAG[49] | 90.0 | 82.1 |
SA-GCN(ours) | 91.5 | 84.7 |
The results on the NTU-120 are shown in Table 2. SA-GCN is able to outperform ST-GCN and 2s-AGCN on both X-View and X-Sub. Compared with GCN-NAS, the accuracy of the SA-GCN achieves suboptimal. This is due to the higher complexity of the GCN-NAS model. However, our model does not involve complex parameters and is basically the same as ST-GCN. Meanwhile, it should be emphasized that the attention mechanisms advocated in SA-GCN all share parameters.
Methods | X-View | X-Sub |
ST-GCN[25] | 78.1 | 75.5 |
2s-AGCN[27] | 77.5 | 78.0 |
GCN-NAS[46] | 80.5 | 78.0 |
STAR[47] | 80.2 | 78.3 |
TSTE[48] | 67.5 | 66.6 |
SA-GCN(ours) | 79.2 | 78.5 |
In this part, we give the influences of the SA-GCN on NTU60 dataset, which is shown in Table 3. As can be seen from these experiments, the proposed model with SA-GCN generates unanimously more promising results than the other methods.
SAM | GCN | X-View | X-Sub |
√ | × | 80.0 | 73.0 |
× | √ | 88.3 | 81.5 |
√ | √ | 91.5 | 84.7 |
In this work, we have proposed a unified spatio-temporal SA-GCN for low-quality motion video data with fixed viewing angle, where the designed SA module can be regarded as a GCN. Based on this module, the proposed model can extract features efficiently by learning weight between joint points of different scales. Furthermore, the designed SA mechanism not only characterizes the multi-scale dependencies of joints, but also integrates the structural features of the graph and the ability of self-learning fusion features. Moreover, since the parameters in the attention mechanism are shared, the total number of parameters of the model does not increase significantly. Extensive experiments on NTU-60 RGB+D and NTU-120 RGB+D datasets show that the proposed model achieves substantial improvements over mainstream methods. In the future, we will focus on optimization of the model structure and application in random motion videos.
The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.
This work was supported in part by the National Nature Science Foundation of China (62072312, 62071303, 61972264), in part by Shenzhen Basis Research Project (JCYJ20210324094009026, JCYJ20200109105832261), the Project of Educational Commission of Guangdong Province (2023KTSCX116) and National Nature Science Foundation of Guangdong Province (2023A1515011394).
The authors declare there is no conflict of interest.
[1] |
Biyik E, Araz M, Hepbasli A, et al. (2017) A key review of building integrated photovoltaic (BIPV) systems. Jestech 20: 833–858. https://doi.org/10.1016/j.jestch.2017.01.009 doi: 10.1016/j.jestch.2017.01.009
![]() |
[2] |
Almulhim AI (2022) Understanding public awareness and attitudes toward renewable energy resources in Saudi Arabia. Renewable Energy 192: 572–582. https://doi.org/10.1016/j.renene.2022.04.122 doi: 10.1016/j.renene.2022.04.122
![]() |
[3] |
Zarate-Perez E, Sebastián R (2022) Autonomy evaluation model for a photovoltaic residential microgrid with a battery storage system. Energy Rep 8: 653–664. https://doi.org/10.1016/J.EGYR.2022.07.085 doi: 10.1016/J.EGYR.2022.07.085
![]() |
[4] |
Anderson TN, Duke M, Morrison GL, et al. (2009) Performance of a building integrated photovoltaic/thermal (BIPVT) solar collector. Sol Energy 83: 445–455. https://doi.org/10.1016/J.SOLENER.2008.08.013 doi: 10.1016/J.SOLENER.2008.08.013
![]() |
[5] |
Debbarma M, Sudhakar K, Baredar P (2017) Comparison of BIPV and BIPVT: A review. Resour-Effic Technol 3: 263–271. https://doi.org/10.1016/J.REFFIT.2016.11.013 doi: 10.1016/J.REFFIT.2016.11.013
![]() |
[6] |
Makki A, Omer S, Sabir H (2015) Advancements in hybrid photovoltaic systems for enhanced solar cells performance. Renewable Sustainable Energy Rev 41: 658–684. https://doi.org/10.1016/J.RSER.2014.08.069 doi: 10.1016/J.RSER.2014.08.069
![]() |
[7] |
Peng C, Huang Y, Wu Z (2011) Building-integrated photovoltaics (BIPV) in architectural design in China. Energy Build 43: 3592–3598. https://doi.org/10.1016/J.ENBUILD.2011.09.032 doi: 10.1016/J.ENBUILD.2011.09.032
![]() |
[8] |
Agathokleous RA, Kalogirou SA (2016) Double skin facades (DSF) and building integrated photovoltaics (BIPV): A review of configurations and heat transfer characteristics. Renewable Energy 89: 743–756. https://doi.org/10.1016/J.RENENE.2015.12.043 doi: 10.1016/J.RENENE.2015.12.043
![]() |
[9] |
Catto Lucchino E, Goia F, Lobaccaro G, et al. (2019) Modelling of double skin facades in whole-building energy simulation tools: A review of current practices and possibilities for future developments. Build Simul 12: 3–27. https://doi.org/10.1007/S12273-019-0511-Y doi: 10.1007/S12273-019-0511-Y
![]() |
[10] |
Shukla AK, Sudhakar K, Baredar P, et al. (2018) BIPV based sustainable building in South Asian countries. Sol Energy 170: 1162–1170. https://doi.org/10.1016/J.SOLENER.2018.06.026 doi: 10.1016/J.SOLENER.2018.06.026
![]() |
[11] |
Saretta E, Caputo P, Frontini F (2019) A review study about energy renovation of building facades with BIPV in urban environment. Sustain Cities Soc 44: 343–355. https://doi.org/10.1016/J.SCS.2018.10.002 doi: 10.1016/J.SCS.2018.10.002
![]() |
[12] |
Kuhn TE, Erban C, Heinrich M, et al. (2021) Review of technological design options for building integrated photovoltaics (BIPV). Energy Build 231: 110381. https://doi.org/10.1016/J.ENBUILD.2020.110381 doi: 10.1016/J.ENBUILD.2020.110381
![]() |
[13] |
Rounis ED, Athienitis A, Stathopoulos T (2021) Review of air-based PV/T and BIPV/T systems-Performance and modelling. Renewable Energy 163: 1729–1753. https://doi.org/10.1016/J.RENENE.2020.10.085 doi: 10.1016/J.RENENE.2020.10.085
![]() |
[14] |
Yang T, Athienitis AK (2016) A review of research and developments of building-integrated photovoltaic/thermal (BIPV/T) systems. Renewable Sustainable Energy Rev 66: 886–912. https://doi.org/10.1016/J.RSER.2016.07.011 doi: 10.1016/J.RSER.2016.07.011
![]() |
[15] |
Shen Y, Ji L, Xie Y, et al. (2021) Research landscape and hot topics of rooftop PV: A bibliometric and network analysis. Energy Build 251: 111333. https://doi.org/10.1016/J.ENBUILD.2021.111333 doi: 10.1016/J.ENBUILD.2021.111333
![]() |
[16] |
Li X, Zhou Y, Xue L, et al. (2015) Integrating bibliometrics and roadmapping methods: A case of dye-sensitized solar cell technology-based industry in China. Technol Forecast Soc Change 97: 205–222. https://doi.org/10.1016/J.TECHFORE.2014.05.007 doi: 10.1016/J.TECHFORE.2014.05.007
![]() |
[17] |
Pillai DS, Shabunko V, Krishna A (2022) A comprehensive review on building integrated photovoltaic systems: Emphasis to technological advancements, outdoor testing, and predictive maintenance. Renewable Sustainable Energy Rev 156: 111946. https://doi.org/10.1016/J.RSER.2021.111946 doi: 10.1016/J.RSER.2021.111946
![]() |
[18] |
Martín-Chivelet N, Kapsis K, Wilson HR, et al. (2022) Building-integrated photovoltaic (BIPV) products and systems: A review of energy-related behavior. Energy Build 262: 111998. https://doi.org/10.1016/j.enbuild.2022.111998 doi: 10.1016/j.enbuild.2022.111998
![]() |
[19] | Salazar-Concha C, Ficapal-Cusí P, Boada-Grau J, et al. (2021) Analyzing the evolution of technostress: A science mapping approach. Heliyon 7: e06726. https://doi.org/10.1016/J.HELIYON.2021.E06726 |
[20] |
Cobo MJ, Lõpez-Herrera AG, Herrera-Viedma E, et al. (2012) SciMAT: A new science mapping analysis software tool. J Am Soc Inf Sci Technol 63: 1609–1630. https://doi.org/10.1002/asi.22688 doi: 10.1002/asi.22688
![]() |
[21] |
Cobo MJ, López-Herrera AG, Herrera-Viedma E, et al. (2011) Science mapping software tools: Review, analysis, and cooperative study among tools. J Am Soc Inf Sci Technol 62: 1382–1402. https://doi.org/10.1002/asi.21525 doi: 10.1002/asi.21525
![]() |
[22] |
Zarate-Perez E, Rosales-Asensio E, González-Martínez A, et al. (2022) Battery energy storage performance in microgrids: A scientific mapping perspective. Energy Rep 8: 259–268. https://doi.org/10.1016/J.EGYR.2022.06.116 doi: 10.1016/J.EGYR.2022.06.116
![]() |
[23] |
Azad AK, Parvin S (2022) Bibliometric analysis of photovoltaic thermal (PV/T) system: From citation mapping to research agenda. Energy Rep 8: 2699–2711. https://doi.org/10.1016/J.EGYR.2022.01.182 doi: 10.1016/J.EGYR.2022.01.182
![]() |
[24] |
Naseer MN, Zaidi AA, Dutta K, et al. (2022) Past, present and future of materials' applications for CO2 capture: A bibliometric analysis. Energy Rep 8: 4252–4264. https://doi.org/10.1016/J.EGYR.2022.02.301 doi: 10.1016/J.EGYR.2022.02.301
![]() |
[25] |
Chen C, Chitose A, Kusadokoro M, et al. (2021) Sustainability and challenges in biodiesel production from waste cooking oil: An advanced bibliometric analysis. Energy Rep 7: 4022–4034. https://doi.org/10.1016/J.EGYR.2021.06.084 doi: 10.1016/J.EGYR.2021.06.084
![]() |
[26] |
Sánchez AD, de la Cruz Del Río Rama M, García JÁ (2017) Bibliometric analysis of publications on wine tourism in the databases Scopus and WoS. ERMBE 23: 8–15. https://doi.org/10.1016/J.IEDEEN.2016.02.001 doi: 10.1016/J.IEDEEN.2016.02.001
![]() |
[27] | Zarate-Perez E, Sebastián R, Grados J (2021) Online labs: A perspective based on bibliometric analysis. 19th LACCEI International Multi-Conference for Engineering, Education Caribbean Conference for Engineering and Technology 175610. https://doi.org/10.18687/LACCEI2021.1.1.267 |
[28] |
Moral-Muñoz JA, Cobo MJ, Peis E, et al. (2014) Analyzing the research in integrative & complementary medicine by means of science mapping. Complement Ther Med 22: 409–418. https://doi.org/10.1016/j.ctim.2014.02.003 doi: 10.1016/j.ctim.2014.02.003
![]() |
[29] |
Ellegaard O, Wallin JA (2015) The bibliometric analysis of scholarly production: How great is the impact? Scientometrics 105: 1809–1831. https://doi.org/10.1007/S11192-015-1645-Z/TABLES/9 doi: 10.1007/S11192-015-1645-Z/TABLES/9
![]() |
[30] |
López-Robles JR, Cobo MJ, Gamboa-Rosales NK, et al. (2021) Mapping the intellectual structure of the international journal of computers communications and control: A content analysis from 2015 to 2019. Adv Intell Syst Comput 1243: 296–303. https://doi.org/10.1007/978-3-030-53651-0_25 doi: 10.1007/978-3-030-53651-0_25
![]() |
[31] |
Gutiérrez-Salcedo M, Martínez MÁ, Moral-Munoz JA, et al. (2017) Some bibliometric procedures for analyzing and evaluating research fields. Appl Intell 48: 1275–1287. https://doi.org/10.1007/S10489-017-1105-Y doi: 10.1007/S10489-017-1105-Y
![]() |
[32] |
Hadj Arab A, Taghezouit B, Abdeladim K, et al. (2020) Maximum power output performance modeling of solar photovoltaic modules. Energy Rep 6: 680–686. https://doi.org/10.1016/J.EGYR.2019.09.049 doi: 10.1016/J.EGYR.2019.09.049
![]() |
[33] |
Li H, Huang J, Wang H, et al. (2021) Effects of receiver parameters on the optical efficiency of a fixed linear-focus Fresnel lens solar system with sliding adjustment. Energy Rep 7: 3348–3361. https://doi.org/10.1016/J.EGYR.2021.05.072 doi: 10.1016/J.EGYR.2021.05.072
![]() |
[34] |
Li B, Zhao A, Xiang D, et al. (2022) Smooth Cu electrodeposition for Cu (In, Ga) Se2 thin-film solar cells: Dendritic clusters elimination by Ag buffer layer. Energy Rep 8: 1847–1852. https://doi.org/10.1016/J.EGYR.2021.12.079 doi: 10.1016/J.EGYR.2021.12.079
![]() |
[35] |
Dey D, Subudhi B (2020) Design, simulation and economic evaluation of 90 kW grid connected Photovoltaic system. Energy Rep 6: 1778–1787. https://doi.org/10.1016/J.EGYR.2020.04.027 doi: 10.1016/J.EGYR.2020.04.027
![]() |
[36] |
Zsiborács H, Zentkó L, Pintér G, et al. (2021) Assessing shading losses of photovoltaic power plants based on string data. Energy Rep 7: 3400–3409. https://doi.org/10.1016/J.EGYR.2021.05.038 doi: 10.1016/J.EGYR.2021.05.038
![]() |
[37] |
Shukla AK, Sudhakar K, Baredar P, et al. (2017) BIPV in Southeast Asian countries-opportunities and challenges. Renew Energy Focus 21: 25–32. https://doi.org/10.1016/J.REF.2017.07.001 doi: 10.1016/J.REF.2017.07.001
![]() |
[38] |
Ghosh A, Sarmah N, Sundaram S, et al. (2019) Numerical studies of thermal comfort for semi-transparent building integrated photovoltaic (BIPV)-vacuum glazing system. Sol Energy 190: 608–616. https://doi.org/10.1016/J.SOLENER.2019.08.049 doi: 10.1016/J.SOLENER.2019.08.049
![]() |
[39] |
Kant K, Anand A, Shukla A, et al. (2020) Heat transfer study of building integrated photovoltaic (BIPV) with nano-enhanced phase change materials. J Energy Storage 30: 101563 https://doi.org/10.1016/j.est.2020.101563 doi: 10.1016/j.est.2020.101563
![]() |
[40] |
Assoa YB, Sauzedde F, Boillot B (2018) Numerical parametric study of the thermal and electrical performance of a BIPV/T hybrid collector for drying applications. Renewable Energy 129: 121–131. https://doi.org/10.1016/J.RENENE.2018.05.102 doi: 10.1016/J.RENENE.2018.05.102
![]() |
[41] |
Yang S, Cannavale A, Prasad D, et al. (2019) Numerical simulation study of BIPV/T double-skin facade for various climate zones in Australia: Effects on indoor thermal comfort. Build Simul 12: 51–67. https://doi.org/10.1007/S12273-018-0489-X doi: 10.1007/S12273-018-0489-X
![]() |
[42] |
Batayneh W, Bataineh A, Soliman I (2019) Investigation of solar tracking performance using isotropic and anisotropic models. Adv Build Energy Res 15: 390–408. https://doi.org/10.1080/17512549.2019.1625810 doi: 10.1080/17512549.2019.1625810
![]() |
[43] | Wang Y, Gawryszewska-Wilczynsk P, Zhang X, et al. (2020) Photovoltaic efficiency enhancement of polycrystalline silicon solar cells by a highly stable luminescent film. Sci China Mater 63: 544–551. https://doi.org/10.1007/S40843-019-1246-5 |
[44] | Kim S, Quy HV, Bark CW (2021) Photovoltaic technologies for flexible solar cells: beyond silicon. Mater Today Energy 19: 100583. https://doi.org/10.1016/J.MTENER.2020.100583 |
[45] |
Jun Huang M (2011) The effect of using two PCMs on the thermal regulation performance of BIPV systems. Sol Energy Mater Sol Cells 95: 957–963. https://doi.org/10.1016/J.SOLMAT.2010.11.032 doi: 10.1016/J.SOLMAT.2010.11.032
![]() |
[46] |
Agathokleous RA, Kalogirou SA (2018) Part Ⅱ: Thermal analysis of naturally ventilated BIPV system: Modeling and simulation. Sol Energy 169: 682–691. https://doi.org/10.1016/J.SOLENER.2018.02.057 doi: 10.1016/J.SOLENER.2018.02.057
![]() |
[47] | Li X, Li P, Wu Z, et al. (2021) Review and perspective of materials for flexible solar cells. Mater Rep: Energy, 100001. https://doi.org/10.1016/J.MATRE.2020.09.001 |
[48] |
Aslani A, Bakhtiar A, Akbarzadeh MH (2021) Energy-efficiency technologies in the building envelope: Life cycle and adaptation assessment. J Build Eng 21: 55–63. https://doi.org/10.1016/J.JOBE.2018.09.014 doi: 10.1016/J.JOBE.2018.09.014
![]() |
[49] |
Gholami H, Rø stvik HN (2020) Economic analysis of BIPV systems as a building envelope material for building skins in Europe. Energy 204: 117931. https://doi.org/10.1016/J.ENERGY.2020.117931 doi: 10.1016/J.ENERGY.2020.117931
![]() |
[50] |
Shukla AK, Sudhakar K, Baredar P (2017) Recent advancement in BIPV product technologies: A review. Energy Build 140: 188–95. https://doi.org/10.1016/J.ENBUILD.2017.02.015 doi: 10.1016/J.ENBUILD.2017.02.015
![]() |
[51] | Shukla KN, Rangnekar S, Sudhakar K (2015) Mathematical modelling of solar radiation incident on tilted surface for photovoltaic application at Bhopal, M.P., India. Int J Ambient Energy 37: 579–588. https://doi.org/10.1080/01430750.2015.1023834 |
[52] |
Shukla AK, Sudhakar K, Baredar P (2016) Exergetic analysis of building integrated semitransparent photovoltaic module in clear sky condition at Bhopal India. Case Stud Therm Eng 8: 142–151. https://doi.org/10.1016/J.CSITE.2016.06.009 doi: 10.1016/J.CSITE.2016.06.009
![]() |
[53] |
Skandalos N, Wang M, Kapsalis V, et al. (2022) Building PV integration according to regional climate conditions: BIPV regional adaptability extending Köppen-Geiger climate classification against urban and climate-related temperature increases. Renewable Sustainable Energy Rev 169: 112950. https://doi.org/10.1016/j.rser.2022.112950 doi: 10.1016/j.rser.2022.112950
![]() |
[54] | Bailey HP (1979) Semi-Arid climates: Their definition and distribution, In: Hall, A.E., Cannell, G.H., Lawton, H.W. Agriculture in Semi-Arid Environments, Eds, Springer, Berlin, Heidelberg, 73–97. https://doi.org/10.1007/978-3-642-67328-3_3 |
[55] |
Tsutsumi H, Tanabe S ichi, Harigaya J, et al. (2007) Effect of humidity on human comfort and productivity after step changes from warm and humid environment. Build Environ 42: 4034–4042. https://doi.org/10.1016/J.BUILDENV.2006.06.037 doi: 10.1016/J.BUILDENV.2006.06.037
![]() |
[56] |
Loader NJ, Santillo PM, Woodman-Ralph JP, et al. (2008) Multiple stable isotopes from oak trees in southwestern Scotland and the potential for stable isotope dendroclimatology in maritime climatic regions. Chem Geol 252: 62–71. https://doi.org/10.1016/J.CHEMGEO.2008.01.006 doi: 10.1016/J.CHEMGEO.2008.01.006
![]() |
[57] |
Barthwal M, Rakshit D (2021) Artificial neural network coupled building-integrated photovoltaic thermal system for indian montane climate. Energy Convers Manage 244: 114488. https://doi.org/10.1016/j.enconman.2021.114488 doi: 10.1016/j.enconman.2021.114488
![]() |
[58] | Li C, Zhang W, Tan J, et al. (2023) Energy performance of an innovative bifacial photovoltaic sunshade (BiPVS) under hot summer and warm winter climate. Heliyon 9: e18700. https://doi.org/10.1016/j.heliyon.2023.e18700 |
[59] | Sorgato MJ, Schneider K, Rüther R (2018) Technical and economic evaluation of thin-film CdTe building-integrated photovoltaics (BIPV) replacing façade and rooftop materials in office buildings in a warm and sunny climate. Renewable Energy 118: 84–98. https://doi.org/doi:10.1016/j.renene.2017.10.091 |
[60] |
An HJ, Yoon JH, An YS, et al. (2018) Heating and cooling performance of office buildings with a-Si BIPV windows considering operating conditions in temperate climates: The case of Korea. Sustainability 10: 4856. https://doi.org/10.3390/su10124856 doi: 10.3390/su10124856
![]() |
[61] |
Rodriguez-Ubinas E, Alhammadi N, Alantali M, et al. (2020) Building integrated photovoltaic solutions in arid climates: Experimental analysis of copper indium gallium selenide and crystalline silicon modules integrated as ventilated façades. WIT Trans Built Environ 195: 115–123. https://doi.org/10.2495/ARC200091 doi: 10.2495/ARC200091
![]() |
[62] |
Yang T, Athienitis AK (2015) Performance evaluation of air-based building integrated photovolta-ic/thermal (BIPV/T) system with multiple inlets in a cold climate. Procedia Eng 121: 2060–2067. https://doi.org/10.1016/j.proeng.2015.09.207 doi: 10.1016/j.proeng.2015.09.207
![]() |
[63] |
Dash A, Agrawal S, Gairola S, et al. (2018) Optimization and performance characteristics of building integrated photovoltaic thermal (BIPVT) system in cold climatic conditions. Asian J Water Environ Pollut 15: 63–72. https://doi.org/10.3233/AJW-180044 doi: 10.3233/AJW-180044
![]() |
[64] |
Hailu G, Dash P, Fung AS (2015) Performance evaluation of an air source heat pump coupled with a building-integrated photovoltaic/thermal (BIPV/T) system under cold climatic conditions. Energy Procedia 78: 1913–1918. https://doi.org/10.1016/j.egypro.2015.11.370 doi: 10.1016/j.egypro.2015.11.370
![]() |
[65] |
Do SL, Shin M, Baltazar JC, et al. (2017) Energy benefits from semi-transparent BIPV window and daylight-dimming systems for IECC code-compliance residential buildings in hot and humid climates. Sol Energy 155: 291–303. https://doi.org/10.1016/j.solener.2017.06.039 doi: 10.1016/j.solener.2017.06.039
![]() |
[66] |
Alhammadi N, Rodriguez-Ubinas E, Alzarouni S, et al. (2022) Building-integrated photovoltaics in hot climates: Experimental study of CIGS and c-Si modules in BIPV ventilated facades. Energy Convers Manage 274: 116408. https://doi.org/10.1016/j.enconman.2022.116408 doi: 10.1016/j.enconman.2022.116408
![]() |
[67] |
Ardiani NA, Suhendri, Koerniawan MD, et al (2019) Application of building integrated photovoltaic in hot humid climate. Case study: office building in Indonesia. IOP Conf Ser Earth Environ Sci 291: 012026. https://doi.org/10.1088/1755-1315/291/1/012026 doi: 10.1088/1755-1315/291/1/012026
![]() |
[68] |
Brennan DA, White C, Barclay M, et al. (2019) Performance characterisation and optimisation of a building integrated photovoltaic (BIPV) system in a maritime climate. Futur Cities Environ 5: 1–9. https://doi.org/doi:10.5334/fce.62 doi: 10.5334/fce.62
![]() |
[69] |
Bot K, Aelenei L, Gonçalves H, et al. (2021) Performance assessment of a building-integrated photovoltaic thermal system in a mediterranean climate-an experimental analysis approach. Energies 14: 2191. https://doi.org/doi:10.3390/en14082191 doi: 10.3390/en14082191
![]() |
[70] |
Salem T, Kinab E (2015) Analysis of building-integrated photovoltaic systems: A case study of commercial buildings under mediterranean climate. Procedia Eng 118: 538–545. https://doi.org/doi:10.1016/j.proeng.2015.08.473 doi: 10.1016/j.proeng.2015.08.473
![]() |
[71] |
Abdelhakim M, Kandar MZ, Lim YW (2019) Experimental investigation of overall energy performance in Algerian office building integrated photovoltaic window under semi-arid climate. J Daylighting 6: 23–41. https://doi.org/10.15627/jd.2019.3 doi: 10.15627/jd.2019.3
![]() |
[72] |
Mesloub A, Albaqawy GA, Kandar MZ (2020) The optimum performance of building integrated photovoltaic (BIPV) windows under a semi-arid climate in Algerian office buildings. Sustainability 12: 1654. https://doi.org/10.3390/su12041654 doi: 10.3390/su12041654
![]() |
[73] |
Pan D, Yu X, Zhou Y (2023) Cradle-to-grave lifecycle carbon footprint analysis and frontier decarbonization pathways of district buildings in subtropical Guangzhou, China. J Cleaner Prod 416: 137921. https://doi.org/10.1016/j.jclepro.2023.137921 doi: 10.1016/j.jclepro.2023.137921
![]() |
[74] |
Zhou Y (2022) Demand response flexibility with synergies on passive PCM walls, BIPVs, and active air-conditioning system in a subtropical climate. Renewable Energy 199: 204–225. https://doi.org/10.1016/j.renene.2022.08.128 doi: 10.1016/j.renene.2022.08.128
![]() |
[75] |
Evola G, Margani G (2016) Renovation of apartment blocks with BIPV: Energy and economic evaluation in temperate climate. Energy Build 130: 794–810. https://doi.org/10.1016/j.enbuild.2016.08.085 doi: 10.1016/j.enbuild.2016.08.085
![]() |
[76] |
Alrashidi H, Ghosh A, Issa W, et al. (2020) Thermal performance of semitransparent CdTe BIPV window at temperate climate. Sol Energy 195: 536–543. https://doi.org/10.1016/j.solener.2019.11.084 doi: 10.1016/j.solener.2019.11.084
![]() |
[77] |
Mangkuto RA, Tresna DNAT, Hermawan IM, et al. (2023) Experiment and simulation to determine the optimum orientation of building-integrated photovoltaic on tropical building façades considering annual daylight performance and energy yield. EBE 3: 414–425. https://doi.org/10.1016/j.enbenv.2023.01.002 doi: 10.1016/j.enbenv.2023.01.002
![]() |
[78] |
Humada AM, Hojabri M, Hamada HM, et al. (2016) Performance evaluation of two PV technologies (c-Si and CIS) for building integrated photovoltaic based on tropical climate condition: A case study in Malaysia. Energy Build 119: 233–241. https://doi.org/10.1016/j.enbuild.2016.03.052 doi: 10.1016/j.enbuild.2016.03.052
![]() |
[79] |
Jhumka H, Yang S, Gorse C, et al. (2023) Assessing heat transfer characteristics of building envelope deployed BIPV and resultant building energy consumption in a tropical climate. Energy Build 298: 113540. https://doi.org/10.1016/j.enbuild.2023.113540 doi: 10.1016/j.enbuild.2023.113540
![]() |
[80] |
Ekoe A Akata AM, Njomo D, Agrawal B (2017) Assessment of building integrated photovoltaic (BIPV) for sustainable energy performance in tropical regions of Cameroon. Renewable Sustainable Energy Rev 80: 1138–1152. https://doi.org/10.1016/j.rser.2017.05.155 doi: 10.1016/j.rser.2017.05.155
![]() |
[81] |
Hamzah AH, Go YI (2023) Design and assessment of building integrated PV (BIPV) system towards net zero energy building for tropical climate. e-Prim 3: 100105. https://doi.org/10.1016/j.prime.2022.100105 doi: 10.1016/j.prime.2022.100105
![]() |
[82] |
Mendis T, Huang Z, Xu S (2020) Determination of economically optimised building integrated photovoltaic systems for utilisation on facades in the tropical climate: A case study of Colombo, Sri Lanka. Build Simul 13: 171–83. https://doi.org/10.1007/s12273-019-0579-4 doi: 10.1007/s12273-019-0579-4
![]() |
[83] |
Shetty S, Bajpai V, Bysani S, et al. (2022) Impact of BIPV panels across various window-to-wall ratios in commercial buildings, to reduce its energy performance index in warm and humid climate zone of India. Commun Comput Inf Sci 1612: 151–172. https://doi.org/10.1007/978-3-031-17098-0_8 doi: 10.1007/978-3-031-17098-0_8
![]() |
[84] | Nibandhe A, Bonyadi N, Rounis E, et al. (2019) Design of a coupled BIPV/T-solid desiccant cooling system for a warm and humid climate. SWC/SHC 2019 Proceedings, 2670–2680. https://doi.org/10.18086/swc.2019.55.10 |
[85] |
Braun P, Rüther R (2010) The role of grid-connected, building-integrated photovoltaic generation in commercial building energy and power loads in a warm and sunny climate. Energy Convers Manage 51: 2457–2466. https://doi.org/10.1016/j.enconman.2010.04.013 doi: 10.1016/j.enconman.2010.04.013
![]() |
[86] |
Rüther R, Braun P (2009) Energetic contribution potential of building-integrated photovoltaics on airports in warm climates. Sol Energy 83: 1923–1931. https://doi.org/10.1016/j.solener.2009.07.014 doi: 10.1016/j.solener.2009.07.014
![]() |
[87] |
Zheng X, Zhou Y (2023) A three-dimensional unsteady numerical model on a novel aerogel-based PV/T-PCM system with dynamic heat-transfer mechanism and solar energy harvesting analysis. Appl Energy 338: 120899. https://doi.org/10.1016/j.apenergy.2023.120899 doi: 10.1016/j.apenergy.2023.120899
![]() |
[88] |
Zhou Y (2022) Artificial intelligence in renewable systems for transformation towards intelligent buildings. EAI 10: 100182. https://doi.org/10.1016/j.egyai.2022.100182 doi: 10.1016/j.egyai.2022.100182
![]() |
SAM | GCN | X-View | X-Sub |
√ | × | 80.0 | 73.0 |
× | √ | 88.3 | 81.5 |
√ | √ | 91.5 | 84.7 |
Methods | X-View | X-Sub |
ST-GCN[25] | 88.3 | 81.5 |
2s-AGCN[27] | 89.8 | 79.1 |
GCN-NAS[46] | 89.8 | 82.7 |
STAR[47] | 89.0 | 83.4 |
TSTE[48] | 85.3 | 80.5 |
TAG[49] | 90.0 | 82.1 |
SA-GCN(ours) | 91.5 | 84.7 |
Methods | X-View | X-Sub |
ST-GCN[25] | 78.1 | 75.5 |
2s-AGCN[27] | 77.5 | 78.0 |
GCN-NAS[46] | 80.5 | 78.0 |
STAR[47] | 80.2 | 78.3 |
TSTE[48] | 67.5 | 66.6 |
SA-GCN(ours) | 79.2 | 78.5 |
SAM | GCN | X-View | X-Sub |
√ | × | 80.0 | 73.0 |
× | √ | 88.3 | 81.5 |
√ | √ | 91.5 | 84.7 |