
Hepatitis B is a major global challenge, but there is a lack of epidemiological research on hepatitis B incidence from a change point perspective. This study aimed to fill this gap by identifying significant change points and trends in hepatitis time series in Xinjiang, China. The datasets were obtained from the Xinjiang Information System for Disease Control and Prevention. The Mann-Kendall-Sneyers (MKS) test was used to detect change points and trend changes on the hepatitis B time series of 14 regions in Xinjiang, and the effectiveness of this method was validated by comparing it with the binary segmentation (BS) and segment regression (SR) methods. Based on the results of change point analysis, the prevention and control policies and measures of hepatitis in Xinjiang were discussed. The results showed that 8 regions (57.1%) with at least one change fell within the 95% confidence interval (CI) in all 14 regions by the MKS test, where five regions (Turpan (TP), Hami (HM), Bayingolin (BG), Kyzylsu Kirgiz (KK), Altai (AT)) were identified at one change point, two change points existed for two regions (Aksu (AK), Hotan (HT)) and three change points was detected in 1 region (Bortala (BT)). Most of the change points occurred at both ends of the sequence. More change points indicated an upward trend in the front half of the sequence, while in the latter half, many change points indicated a downward trend prominently. Finally, in comparing the results of the three change point tests, the MKS test showed a 61.5% agreement (8/13) with the BS and SR.
Citation: Liping Yang, Na Xie, Yanru Yao, Chunxia Wang, Maozai Tian, Kai Wang. Hepatitis B time series in Xinjiang, China (2006–2021): change point detection based on the Mann-Kendall-Sneyers test[J]. Mathematical Biosciences and Engineering, 2024, 21(2): 2458-2469. doi: 10.3934/mbe.2024108
[1] | Kieu Huu Dung, Do Lu Cong Minh, Pham Thi Kim Thuy . Commutators of Hardy-Cesàro operators on Morrey-Herz spaces with variable exponents. AIMS Mathematics, 2022, 7(10): 19147-19166. doi: 10.3934/math.20221051 |
[2] | Muhammad Asim, Ghada AlNemer . Boundedness on variable exponent Morrey-Herz space for fractional multilinear Hardy operators. AIMS Mathematics, 2025, 10(1): 117-136. doi: 10.3934/math.2025007 |
[3] | Muhammad Asim, Ghada AlNemer . Analytical findings on bilinear fractional Hardy operators in weighted central Morrey spaces with variable exponents. AIMS Mathematics, 2025, 10(5): 10431-10451. doi: 10.3934/math.2025475 |
[4] | Chenchen Niu, Hongbin Wang . N-dimensional fractional Hardy operators with rough kernels on central Morrey spaces with variable exponents. AIMS Mathematics, 2023, 8(5): 10379-10394. doi: 10.3934/math.2023525 |
[5] | Babar Sultan, Mehvish Sultan, Aziz Khan, Thabet Abdeljawad . Boundedness of an intrinsic square function on grand p-adic Herz-Morrey spaces. AIMS Mathematics, 2023, 8(11): 26484-26497. doi: 10.3934/math.20231352 |
[6] | Yueping Zhu, Yan Tang, Lixin Jiang . Boundedness of multilinear Calderón-Zygmund singular operators on weighted Lebesgue spaces and Morrey-Herz spaces with variable exponents. AIMS Mathematics, 2021, 6(10): 11246-11262. doi: 10.3934/math.2021652 |
[7] | Javeria Younas, Amjad Hussain, Hadil Alhazmi, A. F. Aljohani, Ilyas Khan . BMO estimates for commutators of the rough fractional Hausdorff operator on grand-variable-Herz-Morrey spaces. AIMS Mathematics, 2024, 9(9): 23434-23448. doi: 10.3934/math.20241139 |
[8] | Muhammad Asim, Ghada AlNemer . Results for fractional bilinear Hardy operators in central varying exponent Morrey space. AIMS Mathematics, 2024, 9(11): 29689-29706. doi: 10.3934/math.20241438 |
[9] | Heng Yang, Jiang Zhou . Compactness of commutators of fractional integral operators on ball Banach function spaces. AIMS Mathematics, 2024, 9(2): 3126-3149. doi: 10.3934/math.2024152 |
[10] | Pham Thi Kim Thuy, Kieu Huu Dung . Hardy–Littlewood maximal operators and Hausdorff operators on p-adic block spaces with variable exponents. AIMS Mathematics, 2024, 9(8): 23060-23087. doi: 10.3934/math.20241121 |
Hepatitis B is a major global challenge, but there is a lack of epidemiological research on hepatitis B incidence from a change point perspective. This study aimed to fill this gap by identifying significant change points and trends in hepatitis time series in Xinjiang, China. The datasets were obtained from the Xinjiang Information System for Disease Control and Prevention. The Mann-Kendall-Sneyers (MKS) test was used to detect change points and trend changes on the hepatitis B time series of 14 regions in Xinjiang, and the effectiveness of this method was validated by comparing it with the binary segmentation (BS) and segment regression (SR) methods. Based on the results of change point analysis, the prevention and control policies and measures of hepatitis in Xinjiang were discussed. The results showed that 8 regions (57.1%) with at least one change fell within the 95% confidence interval (CI) in all 14 regions by the MKS test, where five regions (Turpan (TP), Hami (HM), Bayingolin (BG), Kyzylsu Kirgiz (KK), Altai (AT)) were identified at one change point, two change points existed for two regions (Aksu (AK), Hotan (HT)) and three change points was detected in 1 region (Bortala (BT)). Most of the change points occurred at both ends of the sequence. More change points indicated an upward trend in the front half of the sequence, while in the latter half, many change points indicated a downward trend prominently. Finally, in comparing the results of the three change point tests, the MKS test showed a 61.5% agreement (8/13) with the BS and SR.
Emotion is a reflection of a person's psychological or physiological state. It guides people's behavior to a certain extent. In daily life, emotions always play a very important role, especially in human perception and decision making [1]. Emotion analysis is an interdisciplinary research, involving biology, computer science, artificial intelligence and other fields. It is widely used in medical monitoring, safe driving, product design and other fields.
At present, there are two ways of emotion recognition. One kind is based on the non-physiological signal, through the external behavior of the human body, such as tone of voice, facial expression, body posture, text, etc. [2]. The non-physiological signal information is easy to collect. Natural language processing and speech recognition has been widely used in the field of machine learning research. Technology is relatively mature. Satisfactory results have been obtained in emotion recognition of non-physiological signals. However, external signals may be disguised. Because people can control their own behavior. Even text data will be affected by regional and cultural differences, so the accuracy of emotion recognition of non-physiological signals cannot be guaranteed. The other is emotion recognition based on physiological signals. The common physiological signals such as electrocardiogram, EEG, electromyogram [3]. Compared with the emotion generated by external signals, physiological signals are controlled by the central nervous system and cannot be manipulated subjectively. So the reflection of emotions is more objective, authentic and reliable. Among these physiological signals, EEG signals are mostly used.
In the aspect of EEG emotion recognition, the early studies mainly used the combination of feature extraction and shallow machine learning algorithms. Hosseini and Naghibi-Sistani [4] used pictures from the International Affective Picture System (IAPS) to stimulate subjects, and extracted approximate entropy and wavelet entropy from the EEG signals of five channels. The classification was carried out by support vector machine (SVM). The accuracy was 73.25% in two emotional states, calm-neutral and negatively excited. Daimi and Saha [5] used Dual-Tree Complex Wavelet Packet Transform to extract time-frequency features. The SVM was used in emotion recognition. DEAP dataset is used to validate the method. The identification accuracy was 65.3, 66.9, 71.2 and 69.1% in the four dimensions of valance, arousal, liking and dominance, respectively. Mehmood and Lee [6] extracted the time domain and frequency domain features of Hjorth parameters. Particle swarm optimization (PSO) and genetic search (GS) were used for feature selection, and SVM was used as classifier. Bastos-Filho et al. [7] extracted three kinds of EEG signal features, used K-nearest neighbor (KNN) for emotion recognition, and classified calm and stress emotions in DEAP data. The highest accuracy was 70.1% based on power spectral density (PSD) feature. Yosi et al. [8] used wavelet function to extract features and combined multi-layer perceptron (MLP) to classify four emotions of EEG.
In addition, empirical mode decomposition (EMD) has been widely used in EEG signal processing. Zhuang et al. [9] decomposed EEG signals into IMFs by EMD. The first difference of IMFs sequence, the first difference of phase and the energy were extracted as features. The experimental results on DEAP data showed that the accuracy of arousal was 71.99%, and the accuracy of valence was 69.1%. Ozel and Akan [10] used multiple EMD to get the IMFs. Gram-Schmidt ortho-method was used to extract effective IMF, empirical wavelet transform was used to extract the features of effective IMF. Decision tree and ensemble classifiers are used to classify arousal, valence and dominance, and the classification performance is 72.7, 62.0 and 64.7%, respectively.
In recent years, with the development of artificial intelligence technology, emotion recognition of EEG signals is no longer limited to traditional machine learning methods. Neural networks have developed from the original MLP to deep learning, which is popular at present. Deep learning has been widely used to recognize emotion of EEG signals. Li et al. [11] proposed a hybrid deep learning model. The continuous wavelet transform and scale-map transform were used to preprocess the multi-channel EEG signals. Convolutional Neural Network (CNN) was used to extract features automatically. Long short-term memory (LSTM) was used for emotion recognition. Pandey and Seeja [12] proposed a VMD and deep neural network (DNN) EEG Signal Emotion Recognition method. Hwang et al. [13] considered local information within multiple channels or multiple frequency bands of EEG signals, and used CNN to evaluate the positive, neutral, and negative emotional states on the SEED dataset. Li et al. [14] introduced local and global attention mechanisms to highlight the transferable brain region samples, and proposed a transferable attention neural network. Three datasets of SEED, SEED-IV and MPED were used for verification. Wang et al. [15] used Transformer encoder to learn spatial information from different brain regions hierarchically and combined attention mechanisms to emphasize different contributions of brain regions. Cheng et al. [16] converted the data into 2D frame sequences and used deep forest for emotion recognition based on subject-dependent EEG.
Through literature analysis, we found that the classification research on EEG emotion mainly uses different classification methods to classify different emotional states. Most of them use DEAP datasets for emotion analysis, because it is typical EEG emotion dataset that is publicly available. Most of the existing studies use a single variable selection method, and the univariate filtering method only considers the correlation with the dimension of emotion, so the selected features may be too redundant. The forest with poor fitting quality in ensemble learning will have a negative impact on the overall performance of the model, thus reducing the accuracy of EEG recognition. To solve this issue, we propose a new EEG emotion recognition method. The main contributions of this study are as follows.
1) A compositional framework for EEG emotion recognition is proposed, which integrates VMD, Pearson correlation coefficient, AEN and CF.
2) A new feature selection method is proposed. The quadratic penalty term of the AEN is weighted. The minimum common redundancy maximum relevance criterion is used to calculate the weight of the penalty term, which takes into account not only the redundancy among variables, but also the relevance and redundancy between variables and targets.
3) An adaptive weighted CF algorithm is constructed. Classifiers with good performance are given higher weights to reduce the negative impact of classifiers with poor performance.
The dataset used in this paper are from a public multimodal database established by Sander Koelstra's team at the University of London [17]. DEAP is a classical dataset widely used in EEG emotion analysis. The EEG signals was collected from 32 healthy subjects. The experimental process was to make 32 subjects wear "10–20" international lead standard electrode cap. There were 40 electrode channels, including 32 channels of EEG signals. The remaining 8 channels generated other physiological signals such as electrooculography, electromyography. The electrode distribution of the EEG signal channel is shown in Figure 1, the sampling frequency was 512 HZ. Each subject watched 40 pieces of music videos. Each video was 1 minute long. 63s of EEG were collected each time, including 60s of watching the music video and 3s of baseline EEG without emotional stimulation before watching each video. The labels were generated by the subjects' subjective evaluation of arousal, valence, dominance and liking degree on a scale of 1–9 according to self-assessment manikin (SAM). In this study, 14 channels of EEG signals are used to experiment on the two dimensions of arousal and valence. The 14 channels used are AF3, F3, F7, FC5, T7, P7, O1, AF4, F4, F8, FC6, T8, P8 and O2, which are highlighted in red in Figure 1.
The proposed EEG emotion recognition framework mainly includes five stages: decomposition, extraction of effective components, feature extraction, variable selection and classification. The framework of emotion recognition of EEG is shown in Figure 2.
The process is as follows:
1) Data preprocessing. The data are downsampled. The EEG signals needed for the experiment were selected.
2) EEG signal decomposition. There seem to be few studies using VMD in sentiment analysis. Each preprocessed EEG signal sequence is decomposed by VMD to obtain K IMFs.
3) Effective component extraction. Since some components after decomposition have noise, the Pearson correlation coefficient between each IMF and the original signal is calculated. The components whose correlation value is greater than a certain threshold are selected as the effective components. The components whose correlation coefficient is less than the threshold are deleted.
4) Data segmentation. The IMF signals obtained in 3) are segmented with a sliding window of 1s in length, without overlap. The sliding step is 1s.
5) Feature extraction. PE, PSD, DE and Hjorth parameter characteristics are calculated for each segment data.
6) Variable selection. In order to reduce the complexity of the model, the dimension of extracted features is reduced. The proposed CRRAEN algorithm is used to select the feature variables and obtain the most valuable features.
7) EEG emotion classification. In the classification stage, the features after dimensionality reduction are used to train the WCF classifier. Model performance is evaluated using the evaluation metrics.
VMD [18] is a quasi-orthogonal and adaptive signal decomposition method. The main idea of VMD is to decompose the original signal into IMFs with different center frequencies and bandwidths by iteratively solving the optimal solution of the variational equation. VMD is suitable for processing nonlinear and non-stationary signals, which has obvious advantages over EMD [19], and effectively solves the mode aliasing problem of EMD.
The mathematical expression of the variational problem with constraints is as follows:
min{vk},{ωk}{∑k‖∂t[(δ(t)+jπt)∗vk(t)]e−jωkt‖22}s.t.∑kvk=f | (1) |
f is original input signal, j is an imaginary unit, ∂t is the partial derivative of t, ∗ represents the convolution, δ(t) is the impact signal, vk:={v1,⋯,vK} represents IMFs after VMD decomposition, K represents the number of IMFs, {ωk}:={ω1,⋯,ωK} represents the center frequency of each component. ∑k:=∑Kk=1 represents the sum of all modes.
The quadratic penalty factor α and Lagrange multiplication operator λ(t) are introduced into Eq (1) to convert the constrained problem into an unconstrained problem. α is the balance parameter of data fidelity constraint, then the function expression is:
L({vk},{ωk},λ)=α∑k‖∂t[(δ(t)+jπt)∗vk(t)]e−jωkt‖22+‖f(t)−∑kvk(t)‖22+⟨λ(t),f(t)−∑kvk(t)⟩ | (2) |
In order to obtain the optimal solution of Eq (2), the alternating direction multiplier method is adopted to continuously update vn+1k, ωn+1k, λn+1k iteratively to obtain the "saddle point" of Eq (2), where the expression of vn+1k is:
vn+1k=argmin{α‖∂t[(δ(t)+jπt)∗vk(t)]e−jωkt‖22+‖f(t)−∑ivi(t)+λ(t)2‖22} | (3) |
where ωk=ωn+1k, ∑ivi(t)=∑i≠kvi(t)n+1.
Fourier transform is applied to Eq (3) to obtain the frequency domain expressions of IMF and center frequency, respectively.
ˆvn+1k(ω)=ˆf(ω)−∑i≠kˆvi(ω)+ˆλ(ω)21+2α(ω−ωk)2 | (4) |
ωn+1k=∫∞0ω|ˆvk(ω)|2dω∫∞0|ˆvk(ω)|2dω | (5) |
ˆvn+1k is Wiener filtering of ˆf(ω)−∑i≠kˆvi(ω), ωn+1k is center frequency; n is the number of iterations, ˆvk(ω) denotes the inverse Fourier transform. The real part of the result is vk(t).
EEG signals have obvious nonlinear non-stationary characteristics. VMD is beneficial to reduce the non-stationarity of EEG signals.
EEG signal is a kind of physiological signal with strong randomness and high time-varying sensitivity. When the EEG signal is directly input into the classifier, the classifier cannot get the ideal result. So feature extraction is necessary. This paper mainly extracts the PSD, permutation entropy (PE), differential entropy (DE) and Hjorth parameter features of EEG signals.
1) PSD
PSD represents the signal power in the unit frequency band of the input signal, which is utilized to measure the mean square value of random variables. The simplest method to solve the PSD is the periodogram method. Because the periodogram method is biased estimation, the obtained power spectrum is not smooth. The Welch method is used in this paper to calculate the PSD [20].
The sequence data x(t) is divided into L segments. Each segment has M data. The window function w(t) operates on each segment of data. The power spectral of the ithsegment is calculated.
Pi(ω)=1U|M−1∑t=0xi(t)w(t)e−jωt|2 | (6) |
U=1MM−1∑t=0w2(t), the PSD is as follows:
P(xi)=1LL∑i=1Pi(ω) | (7) |
2) PE
PE is a simple and fast nonlinear method to quantitatively analyze the complexity of sequences. It can detect the real-time dynamic characteristics of sequences and has strong anti-noise ability [21].
Suppose there is a set of time series X={x1,x2,⋯,xn}. According to the reconstruction theory, the phase space of element xi in X is reconstructed. The obtained reconstruction matrix is Xi={xi,xi+1,⋯,xi+(m−1)τ}. m is the embedding dimension and τ represents the delay time.
Each sequence in the reconstructed vector is rearranged in ascending order, resulting in a new vector consisting of the original position coordinates of the elements in the vector.
xi+(j1−1)τ⩽xi+(j2−1)τ⩽⋯⩽xi+(jm−1)τ | (8) |
j1,j2,⋯,jm is the position of each element in Xi.
For any type of positional order, there are m! possible permutations. The frequency p1,p2,⋯,pk of occurrence of the number of each permutation corresponding to the k reconstructed components in the whole permutation is calculated as the probability, where k⩽m!. Calculate the information entropy of all types of probabilities.
H=−k∑i=1pilogpi | (9) |
The PE of time series is defined as:
PE=Hlnm! | (10) |
PE measures the randomness of time series. The larger the PE, the more irregular the time series.
3) DE
DE is used to measure the complexity of continuous variables. Previous studies have shown that DE can provide relatively stable EEG features [22,23]. DE is calculated as follows.
DE=−∫∞−∞1√2πσ2e−(x−μ)22σ2log(1√2πσ2e−(x−μ)22σ2)dx=12log(2πeσ2) | (11) |
The distribution of EEG signal sequence approximately follows the Gaussian distribution of N(μ,σ2). According to the formula (11), the DE is equal to the logarithm of the energy spectrum in a certain frequency band.
4) Hjorth parameter
Hjorth parameter [24] is widely used in EEG signal analysis [25,26] due to its low computational cost. It includes three parameters: Activity, Mobility and Complexity.
Activity represents the amplitude characteristics of the EEG, which is defined as the variance of the signal.
Ac=σ2 | (12) |
Mobility represents the time scale of the EEG signal. It is defined as the ratio of the standard deviation of the first difference to the standard deviation of the EEG signal.
Mo=σ′σ | (13) |
Complexity is used to characterize the complexity of EEG signals. It is defined as the ratio of the mobility of the first derivative of the EEG signal to the mobility of the vibration signal.
Co=σ″σ′/σ″σ′σ′σσ′σ | (14) |
σ″ is the standard deviation of the second-order difference of the original vibration signals.
The coefficient of AEN [27] is estimated as
ˆβaen=argminβ{‖Y−p∑i=1Xiβi‖2+λ2p∑i=1β2i+λ1p∑i=1ˆwi|βi|} | (15) |
ˆwi is the adaptive penalty weight, ˆwi=|ˆβi,en|−γ, ˆβi,en is the coefficient estimate of the elastic net, γ>0 is constant.
The minimum redundancy maximum relevance (mRMR) criterion proposed by Ding and Peng [28] uses mutual information (MI) as a metric criterion, and aims to find out M features that have the largest relevance with the target variable and the minimum mutual redundancy from the feature set space.
max[1|S|∑xj∈SI(xj,c)−1|S|2∑xj,xi∈SI(xj,xi)] | (16) |
where, S and |S| represent the set of features and the number of features, respectively. c is the target variable. I(xj,c) represents the MI between variable xj and target variable c. I(xj,xi) represents the MI between variable xj and variable xi.
The redundant part of mRMR only considers the MI between features, but ignores the common mutual information (CMI) between features and target variables. The estimation of CMI is constructed by Bennasar's "maximum of the minimum" method [29]. Based on AEN and mRMR, a new variable selection method called AEN based on minimum common redundancy maximum relevance, abbreviated as CRRAEN, is proposed. Weight is given to the quadratic penalty term of the AEN. At the same time, the minimum common redundancy maximum relevance (mCRMR) criterion is used to calculate the weight matrix.
The maximum common redundant MI can be expressed as follows. Assume that X is the candidate variable and Y is the selected variable. The CMI is determined by the redundancy information rate of the candidate variable and the MI between the selected variable, the candidate variable and the target variable. CMI is expressed as
CMI(Xk,S,c)=maxXj∈S{I(Xk;Xj)max{I(Xk;Xj),I(Xk;c),I(Xj;c)}×min{I(Xk;c),I(Xj;c)}} | (17) |
The Eq (17) represents the redundancy of candidate variable Xk and selected variable subset S with respect to c. According to the minimum common redundancy maximum relevance criterion, the MI can be expressed as
f(Xk)=I(Xk,c)−CMI(Xk,S,c) | (18) |
For n×p-dimensional data, the importance of defining the kthvariable is expressed as:
Ik=f(Xk) | (19) |
The weight coefficient of the kthvariable is
wk={1Ik,Ik>η1η,otherwise | (20) |
where, 0<η⩽1 indicates the given threshold. When Ik>η, the kthvariable has obvious significance. When Ik⩽η, the kthvariable is not significant in predicting the explanatory variable. The weight matrix is expressed as:
W=diag(w1,⋯,wp) | (21) |
The penalty term of the AEN is reconstructed. The weight is calculated according to (21).
(1−α)‖√Wβ‖2+αp∑j=1wj|βj| | (22) |
The CRRAEN adaptive variable selection model is expressed as
L(λ,α,β)=‖y−Xβ‖2+λ((1−α)‖√Wβ‖2+αp∑j=1wj|βj|) | (23) |
‖√Wβ‖2=p∑j=1wjβ2j | (24) |
0⩽α⩽1, λ is the penalty parameter, and the estimated value can be expressed as
ˆβ=argminβ{‖y−Xβ‖2+λ((1−α)‖√Wβ‖2+αp∑j=1wj|βj|)} | (25) |
MCRMR is used to remove the common redundant information in the affective features. The purer emotional feature information is incorporated into the calculation of the weight of the penalty term, which is more conducive to get the real coefficient estimate. The model considers the global normalization of sentiment dimension, avoids the complex calculation of MI and redundant estimation, and realizes the control of role of redundant variables.
CF is an ensemble algorithm proposed by Zhou and Feng [30] in 2017. Inspired by DNN's layer-by-layer feature representation learning, a CF is established. Each cascade is composed of multiple layers. Generally, each layer is composed of two random forests (RF) and two completely random forests (CRF), respectively. The data characteristics are processed layer by layer through the CF stage, which strengthens the representation learning ability of the algorithm and is conducive to improving the accuracy of prediction. In the stage of the CF, in addition to the first layer with original features as input data. Each layer from the previous layer to obtain the processed characteristic information. The processing result output of this layer is passed to the next layer. Each subsequent layer concatenates the output result of the previous layer with the original input feature vector as the input of this layer. The CF structure is shown in Figure 3.
The difference of RF generalization ability is not considered in the CF. Forests with poor generalization ability will have a negative impact on the overall performance of the model. In order to alleviate this problem, we propose a Weighted Cascade Forest (WCF). Each learner at each level is given a weight. The weight β is assigned to each classifier according to its classification performance, as shown in Figure 4. The area AUC (Aera Under Curve) under the Receiver Operating Characteristic (ROC) curve is used to calculate the weight. The weight calculation process is as follows.
1) For the first level, the True Positive Rate (TPR) and False Positive Rate (FPR) are calculated based on the classification results of each classifier.
TPR=tptp+fnFPR=fpfp+tn | (26) |
TPR represents the proportion of positive samples with correct prediction to the actual positive samples. FPR represents the proportion of positive samples with incorrect prediction to the actual negative samples. Where, tp and tn are the number of true positive and true negative, respectively. fp and fn are the number of false positive and false negative, respectively.
2) With TPR as the vertical axis and FPR as the horizontal axis, the ROC curve is drawn along the lines of the tracing points. The area under the ROC curve is calculated to obtain the AUC. Then the weight of each classifier is
βi=AUCiI∑i=1AUCi | (27) |
where I represents the number of learners in each layer.
3) The class probability vector output by each classifier is multiplied with the corresponding βi, and then concatenated with the original input as the input of the next level. The weights of other classifiers are calculated using the same method.
4) Finally, the class with the greatest probability is selected as the classification result.
The number of cascaded forest layers is adaptive by the algorithm. If adding a cascade layer does not improve the performance of the model, the training process is terminated.
This experiment studies the subject-independent sentiment analysis. For partitioning the data set, 10-fold cross-validation is used to evaluate the performance. The metrics are as follows.
recall=tptp+fnprecision=tptp+fpaccuracy=tp+tntp+fn+fp+tn | (28) |
Parameters settings: PE takes two parameters: m and τ. If m is selected too large, the computational complexity will increase; If m is too small, the permutation space will be small. In many applications, m = 3 to 7 are recommended. We set m to be 4 and τ to be 1. VMD has two parameters: K and α. In order to avoid the arbitrariness of manual setting, Ensemble Empirical Mode Decomposition (EEMD) [31] is used for reference. EEMD is an adaptive signal decomposition method using a noise-assisted program to alleviate the mode mixing problem in EMD by repeatedly adding white Gaussian noise. K is determined by the model itself and does not need to be manually set. After the EEG signal is decomposed by EEMD, 12 IMFs can be obtained adaptively. The value of K for VMD is set to 12, the same as that for EEMD. The default value for α is 2000. For the weighted cascade classifier, the number of trees and forests on the arousal dimension is set to 250 and 5, respectively. There are three RF and two CRF. On the valence dimension, the number of trees and forests is 150 and 5, respectively. There are two RF and three CRF, respectively. The Gini coefficient is used to select the partition attributes. The minimum sample number of leaf nodes is set to 1.
In this paper, two experiments are set to classify high/low arousal and high/low valence respectively. The original label is a score from 1 to 9. The score less than 5 is set as the low dimension, and the score greater than or equal to 5 is set as the high dimension. The class distribution is shown in Figure 5. First, the sample is downsampled to 128 HZ, and a bandpass filter from 4 HZ to 45 HZ is utilized to remove the ocular artifacts. Since the subjects did not watch the video for the first 3s, they could not show emotion. At the beginning and end of watching the video, subjects will have a process of entering the state and coming out of the state. Their emotions are mainly concentrated in the middle segment of the video. Therefore, the EEG signals in the first 13s and the last 10s are removed, and only the data in the middle 40s are retained for analysis. After data preprocessing, the dimension of EEG data of each subject is 40 × 14 × 5120 (video × channel × sample). Among them, 5120 is the number of sampling points within 40s at the sampling frequency of 128 HZ.
For the purpose of results presentation, the EEG of a single subject is illustrated. Figure 6 shows the EEG signal of AF3 channel of music video watched by a single subject. After VMD decomposition, a set of IMFs with vibration around different center frequencies are obtained, as shown in Figure 7. As can be seen from the figure, IMFs is arranged from low frequency to high frequency.
The high frequency components after VMD decomposition usually contain noise. In order to eliminate the interference of noise, pearson correlation coefficient method is used to calculate the correlation between each IMF and the original signal. The larger the absolute value of the correlation coefficient, the stronger the correlation. In this paper, the IMFs whose absolute value of correlation coefficient are greater than 0.35 is retained. Figure 8 shows the correlation coefficient between each IMF and the original signal. By comparison, IMF1–IMF5 is finally retained.
The features of all IMF1–IMF5 are extracted by sliding window. 16,800 dimensional features are obtained. In order to reduce the time cost, the CRRAEN is used for variable selection, and the 168-dimensional features obtained after variable selection are input into the WCF for classification. Figure 9 shows the confusion matrix obtained by the proposed method of valence and arousal emotion recognition. The element on the positive diagonal represents the accuracy of the correct classification of the sample. The confusion matrix shows that high arousal has higher recognition accuracy than low arousal. The recognition accuracy of high valence and low valence have similar recognition accuracy.
In terms of accuracy, precision and recall, the proposed method is compared with CF, WCF, RFVS-CF, mRMR-CF, EN-CF, AEN-CF and CRRAEN-CF methods. All methods are based on VMD decomposition and feature extraction. CF and WCF indicates that all features are used for classification. Other methods use different variable selection methods combined with CF. RFVS represents RF variable selection method.
Table 1 shows the results of the proposed method on the arousal dimension. For CF and WCF methods, the accuracy, precision and recall of WCF are 3.05, 2.55 and 1.83% higher than CF, respectively. It shows that the WCF is helpful to improve the accuracy. The accuracy of the proposed CRRAEN-WCF is 4.3, 8.44, 9.69, 10.79 and 12.43 higher than CRRAEN-CF, RFVS-CF, mRMR-CF, AEN-CF, EN-CF, respectively, and higher than that of CF and WCF. The precision of the proposed CRRAEN-WCF is 76.14% higher than other methods. The recall of the proposed method is 78.88% better than CRRAEN-CF, AEN-CF, EN-CF, mRMR-CF, RFVS-CF. The above results demonstrate the effectiveness of the proposed method.
Method | Accuracy | Precision | Recall |
CF | 0.6375 | 0.6412 | 0.7277 |
WCF | 0.6680 | 0.6667 | 0.7460 |
RFVS-CF | 0.6633 | 0.6715 | 0.7354 |
mRMR-CF | 0.6508 | 0.6530 | 0.7347 |
EN-CF | 0.6234 | 0.6335 | 0.7151 |
AEN-CF | 0.6398 | 0.6516 | 0.7201 |
CRRAEN-CF | 0.7047 | 0.7192 | 0.7571 |
CRRAEN-WCF | 0.7477 | 0.7614 | 0.7888 |
Table 2 shows the results of the proposed method on the valence dimension. The accuracy of the proposed CRRAEN-WCF is 3.67, 4.22, 7.19, 6.25, 4.46, 4.06 and 6.17% higher than CRRAEN-CF, AEN-CF, EN-CF, mRMR-CF, RFVS-CF, WCF, CF, respectively. The precision of the proposed method is 83.82% higher than CRRAEN-CF, AEN-CF, EN-CF, mRMR-CF, RFVS-CF. The recall of CRRAEN-WCF is 82.50% better than other comparison methods.
Method | Accuracy | Precision | Recall |
CF | 0.7477 | 0.7543 | 0.7795 |
WCF | 0.7688 | 0.7784 | 0.7956 |
RFVS-CF | 0.7648 | 0.7862 | 0.7879 |
mRMR-CF | 0.7469 | 0.7735 | 0.7702 |
EN-CF | 0.7375 | 0.7576 | 0.7650 |
AEN-CF | 0.7672 | 0.7847 | 0.7913 |
CRRAEN-CF | 0.7727 | 0.8325 | 0.7820 |
CRRAEN-WCF | 0.8094 | 0.8382 | 0.8250 |
The experimental results of arousal and valence show that the proposed CRRAEN feature selection method has more obvious advantages than RFVS, mRMR, EN and AEN. CRRAEN takes full account of the redundant information of variables, and combines the advantages of AEN, which excludes irrelevant variables. CRRAEN can significantly improve the accuracy of emotion recognition. Regardless of variable selection, WCF has a higher accuracy of emotion recognition than CF, which also confirms the effectiveness of weighting.
To evaluate the classification performance of the proposed approach from a statistical perspective, we perform Friedman test with 90% confidence on the performance of the different algorithms. It is commonly used in medical statistical analysis. Figure 10 is the Friedman test chart drawn according to the ranking results of the accuracy values of each algorithm. The vertical axis represents the algorithm, and the horizontal axis represents the average order value. If the horizontal lines of the two algorithms overlap, it means that there is no significant difference between the two algorithms; otherwise, there is a significant difference. It is obvious from Figure 10(b) that the horizontal line segments of the proposed CRRAEN-CF, AEN-CF, EN-CF, MRMR-CF, RFVS-CF, WCF and CF algorithms do not overlap, respectively. For valence, CRRAEN-WCF is significantly different from the comparison algorithm. In Figure 10(a), except for CRRAEN-CF, the horizontal segments of CRRAEN-WCF and AEN-CF, EN-CF, mRMR-CF, RFVS-CF, WCF and CF do not overlap, respectively. It demonstrates that CRRAEN-WCF is significantly different from other algorithms. However, there is overlap with the horizontal line segment of CRRAEN-CF, which is caused by the low accuracy of the proposed method in low arousal.
Table 3 lists the performance of emotion recognition methods using the same DEAP dataset in the existing literature. The items listed in the table are related to whether the methods in the existing literature use decomposition methods, which features are extracted, which feature selection method is used, which classifier is used and the number of classes. The accuracy of our proposed method is higher than that of some classical classification methods, such as SVM [3,32,36,38], RF [35] and KNN [37]. Among them, Atkinson and Campos [36] achieved relatively high performance by using MRMR reduction and GA-SVM classification, thanks to the optimization of SVM parameters by genetic algorithm. Compared with CWT-CRNN [11], VMD-DNN [12], DBN [33], PCA-SAE [34] and Transformer [15] in the deep model, the proposed method achieves higher accuracy. Among them, reference [11] combined CNN and LSTM. The accuracy of valence is 72.06%, and the accuracy of arousal is 74.12%, which is higher than other deep models. Literature [39] used MRMR variable selection combined with CF to classify five emotions, and the accuracy is 51.3%. The variable selection method proposed in this paper considers the common redundant information to construct mCRMR, and combines the embedded variable selection method AEN, which reduces the feature redundancy compared with MRMR. In addition, weight is given to the classifiers with good performance in the CF, which improves the role of strong classifiers compared with CF. The above results show that the proposed VMD-CRRAEN-WCF method has certain competitiveness.
Reference | Feature | Feature selection | Classifier | Number of classes | accuracy |
Koelstra et al. (2010) [32] | PSD | SVM | 3 | valence: 58.8 arousal: 55.7 | |
Wang and Shang (2013) [33] | DBN | 2 | valence: 51.2 arousal: 60.9 | ||
Jirayucharoensak et al. (2014) [34] | PS, difference between PS | SAE | 3 | valence: 49.52 arousal: 46.03 | |
PS, difference between PS | PCA | SAE | 3 | valence: 52.53 arousal: 49.17 | |
Ackermann et al. (2016) [35] | statistics, STFT, Higher Order Crossing, Hilbert-Huang Spectrum | MRMR | RF | 3 | anger, surprise and other: 55.23 |
Atkinson and Campos (2016) [36] | statistics, band power (BP), Hjorth parameters (HP), fractal dimension (FD) | MRMR | GA-SVM | 2 | valence: 73.14 arousal: 73.06 |
Li et al. (2017) [11] |
CWT, scale-map transform | CRNN | 2 | valence: 72.06 arousal: 74.12 |
|
Mert and Akan (2018) [37] |
MEMD-PSD, HP, entropy | ICA | KNN | 2 |
valence: 67 arousal: 51.01 |
Tiwari and Falk (2019) [3] | Benchmark Features, Motif-Based Features |
ANOVA | SVM | 2 | valence: 59.3 arousal: 54.46 |
Benchmark Features, Motif-Based Features |
MRMR | SVM | 2 | valence: 58.16 arousal: 56.45 | |
Benchmark Features, Motif-Based Features |
RFE | SVM | 2 | valence: 60.10 arousal: 55.98 |
|
Yin et al. (2020) [38] |
Temporal, frequency, statistics | LRFS | LSSVM | 2 | valence: 70.9 arousal: 67.43 |
Fang et al. (2021) [39] | PSD, DE | MRMR | CF | 5 | angry, happy, sad, pleasant, and neutral: 51.3 |
Pandey and Seeja (2022) [12] | VMD-PSD, First Difference of IMF | DNN | 2 | valence: 62.5 arousal: 61.25 | |
EMD-PSD, First Difference of IMF | DNN | 2 | valence: 56 arousal: 60 | ||
Wang et al. (2022) [15] | Transformer | 2 | valence: 66.63 arousal: 66.2 |
||
Proposed method | VMD-PSD, PE, DE, Hjorth parameters | CRRAEN | WCF | 2 | valence: 80.94 arousal: 74.77 |
Confirmation of hyper-parameters of WCF: We investigate the change of classification performance with the number of RF and decision tree. For the arousal and valence classification tasks, we use line graphs to visualize the trend of accuracy with the number of trees. When we study the changing trend of accuracy as the number of trees, the other parameters remain unchanged, the number of RF and CRF set to 2. The change interval of tree is 50. According to Figure 11, at the beginning, the accuracy improves gradually with the increase of the number of trees; when the tree reaches a certain amount, accuracy reaches maximum; when the number of trees continues to increase, the accuracy of the model fluctuates in a small range. The accuracy of WCF is highest when the number of trees reached 250 on arousal dimension, after which there is a slight fluctuation. When the number of trees in WCF is 150 on valence dimension, the accuracy is higher. From the perspective of model complexity, the number of trees is set as 150.
When we observed the effect of the number of forests on the accuracy, the other parameters are fixed. We use the confusion matrix to visualize, where the horizontal axis represents the number of CRF, the vertical axis represents the number of RF, and the label value represents the accuracy. It can be seen that with the increase of forest, the accuracy rate does not show a certain regularity. When 2 CRF and 3 RF in arousal, the accuracy value of arousal was the highest. When there were 3 CRF and 2 RF, the best accuracy rate was achieved on the valence dimension. In the arousal dimension, the optimal accuracy was obtained when two CRF and three RF were selected.
Running time:The configuration used in the experiment is Intel i7-10510U CPU (4 cores), and the memory is DDR4 2666 MHz 16G. Python3.9 version is used for training. The computing cost is shown in Table 4. The calculational cost of the algorithm without dimension reduction is relatively large, and the time cost of the algorithm after variable selection is significantly reduced. The proposed method in this paper combines two variable selection methods, and the classification time is higher than other single variable selection methods, but the accuracy is significantly improved.
Method | Time(s) |
CF | 1897.95 |
WCF | 1698.97 |
RFVS-CF | 259.90 |
mRMR-CF | 208.65 |
EN-CF | 259.51 |
AEN-CF | 224.36 |
CRRAEN-CF | 496.52 |
CRRAEN-WCF | 474.68 |
This paper mainly studies and analyzes the emotion recognition algorithm based on the EEG signal. In this paper, a new hybrid framework based on VMD-CRRAEN-WCF is proposed to improve the accuracy of EEG emotion recognition. Firstly, VMD is used to decompose the EEG signal and select the most relevant IMF with the original signal. CRRAEN is used to select features that are significantly correlated with the target variables. Finally, a weighted CF classifier is used for emotion recognition. Compared with EMD, VMD is a non-recursive signal analysis method, which can effectively resist noise interference and reduce the mode aliasing problem. CRRAEN is a new feature selection method, which combines the AEN and the minimum common redundancy maximum relevance criterion. CRRAEN fully considers the common redundant MI, and can effectively select related variables and reduce redundant. CF is an ensemble learning algorithm, which does not rely on large samples and precise tuning of hyper-parameters. The weighted CF mainly considers the different contributions of the forest learner, and has achieved satisfactory results on the public DEAP dataset.
Compared with some existing research methods, the proposed method has higher accuracy, which indicates the effectiveness of the proposed algorithm framework. The framework can also be applied to emotion recognition of other physiological signals.
This work was supported by the Science and Technology Department Project of Jilin Province (Grant No. 20210101149JC), the Science and Technology Department Project of Jilin Province (Grant No. 20200403182SF), and the Education Department Project of Jilin Province (Grant No. JJKH20210625KJ). The authors are very grateful for the supports.
The authors declare there is no conflict of interest.
[1] |
D. Razavi-Shearer, I. Gamkrelidze, M. H. Nguyen, D. S. Chen, P. Van Damme, Z. Abbas, et al., Global prevalence, treatment, and prevention of hepatitis B virus infection in 2016: a modelling study, Lancet Gastroenterol., 3 (2018), 383–403. https://doi.org/10.1016/S2468-1253(18)30056-6 doi: 10.1016/S2468-1253(18)30056-6
![]() |
[2] |
N. Nelson, P. Easterbrook, B. Mcmahon, Epidemiology of hepatitis b virus infection and impact of vaccination on disease, Clin. Liver Dis., 20 (2016), 607–628. https://doi.org/10.1016/j.cld.2016.06.006 doi: 10.1016/j.cld.2016.06.006
![]() |
[3] | WHO, Hepatitis B fact sheets, Available from: http://www.who.int/news-room/fact-sheets/detail/hepatitis-b. (accessed July 4, 2023) |
[4] |
J. Shen, N. Zhang, Change-point model on nonhomogeneous poisson processes with application in copy number profiling by next-generation DNA sequencing, Ann. Appl. Stat., 6 (2012), 476–496. https://doi.org/10.1214/11-AOAS517 doi: 10.1214/11-AOAS517
![]() |
[5] |
V. Muggeo, G. Adelfio, Efficient change point detection for genomic sequences of continuous measurements, Bioinformatics, 2 (2011), 27. https://doi.org/10.1093/bioinformatics/btq647 doi: 10.1093/bioinformatics/btq647
![]() |
[6] | G. Teyssire, A. Kirman, Long memory in economics, Springer Berlin Heidelberg, 2007. https://doi.org/10.1007/978-3-540-34625-8 |
[7] |
X. Cao, J. Wang, J. Liao, Z. Gao, D. Jiang, J. Sun, et al., EBacterioplankton community responses to key environmental variables in plateau freshwater lake ecosystems: A structural equation modeling and change point analysis, Sci. Total Environ., 580 (2017), 457–467. https://doi.org/10.1016/j.scitotenv.2016.11.143 doi: 10.1016/j.scitotenv.2016.11.143
![]() |
[8] | K. Shi, Y. Touge, Identifying the shift in global wildfire weather conditions over the past four decades: An analysis based on change-points and long-term trends, Geosci. Lett., 10 (2023). https://doi.org/10.1186/s40562-022-00255-6 |
[9] | G. Texier, M. Farouh, L. Pellegrin, M. Jackson, J. Meynard, X. Deparis, et al., Outbreak definition by change point analysis: A tool for public health decision?, BMC Med. Inform. Decis. Mak., 16 (2016). https://doi.org/10.1186/s12911-016-0271-x |
[10] |
X. Chen, H. Wang, W. Lyu, R. Xu, The mann-kendall-sneyers test to identify the change points of covid-19 time series in the united states, BMC Med. Res. Methodol., 22 (2022), 1–9. https://doi.org/10.1186/s12874-022-01714-6 doi: 10.1186/s12874-022-01714-6
![]() |
[11] |
V. Punyapornwithaya, Lumpy skin disease outbreaks in africa, europe, and asia (2005–2022): Multiple change point analysis and time series forecast, Viruses, 14 (2022), 2203. https://doi.org/10.3390/v14102203 doi: 10.3390/v14102203
![]() |
[12] |
H. Mann, Nonparametric tests against trend, Econometrica, 13 (1945), 245–259. https://doi.org/10.2307/1907187 doi: 10.2307/1907187
![]() |
[13] | M. Kendall, Rank Correlation Methods, Br. J. Psychol., 25 (1990), 86–91. |
[14] | R. Sneyers, On the Statistical Analysis of Series of Observations, J. Biol. Chem., 258 (1990), 13680–13684. |
[15] |
Q. Zhang, C. Xu, S. Becker, T. Jiang, Sediment and runoff changes in the Yangtze River basin during past 50 years, J. Hydrol., 331 (2006), 511–523. https://doi.org/10.1016/j.jhydrol.2006.05.036 doi: 10.1016/j.jhydrol.2006.05.036
![]() |
[16] | M. Kendall M, A. Stuart, The advanced theory of statistics. Vol.3: Design and analysis, and time-series, J. R. Stat. Soc., 147 (1983). https://doi.org/10.2307/2988093 |
[17] | D. R. Helsel, R. M. Hirsch, K. R. Ryberg, S. A. Archfield, E. J. Gilroy, Statistical Methods in Water Resources, U.S. Geological Survey Techniques and Methods, book 4, chap. A3,458p, 2020. https://doi.org/10.3133/tm4a3 |
[18] | C. Fu, Q. Wang, The Definition and Detection of the Abrupt Climatic Change, Chin. J. Atmos. Sci., 04 (1992), 482–493. |
[19] | J. Searcy, C. Hardison, Double mass curves. Geological Survey Water Supply Paper 1541-B, U.S. Geological Survey, Washington., 1960. |
[20] | S. Wglarczyk, On the stationarity of extreme levels of some Polish lakes. I. Preliminary results from statistical test, Limnol. Rev., 9 (2009), 129–138. |
[21] |
H. Cho, P. Fryzlewicz, Multiple-change-point detection for high dimensional time series via sparsified binary segmentation, J. R. Stat. Soc., 77 (2015), 475–507. https://doi.org/10.1111/rssb.12079 doi: 10.1111/rssb.12079
![]() |
[22] | V. Muggeo, G. Sottile, M. Porcu, Modelling COVID-19 outbreak: Segmented regression to assess lockdown effectiveness, Research Gate, 10 (2020). |
[23] |
R. Killick R, P. Fearnhead, I. Eckley, Optimal detection of changepoints with a linear computational cost, J. Am. Stat. Assoc., 107 (2012), 1590–1598. https://doi.org/10.1080/01621459.2012.737745 doi: 10.1080/01621459.2012.737745
![]() |
[24] |
R. Killick, I. Eckley, Changepoint: An r package for changepoint analysis, J. Stat. Softw., 58 (2015), 1–19. https://doi.org/10.18637/jss.v058.i03 doi: 10.18637/jss.v058.i03
![]() |
[25] | V. Muggeo, Segmented: An R Package to Fit Regression Models With Broken-Line Relationships, Rnews., 8 (2008), 20–25. |
[26] | V. Muggeo, Regression Models with Breakpoints/Changepoints Estimation, 2015. |
[27] |
A. Hardi, K. Kawai, S. Lee, K. Maekawa, Change Point Analysis of Exchange Rates Using Bootstrapping Methods: An Application to the Indonesian Rupiah 2000–2008, Asia-Pacific Financ. Mark., 22 (2015), 429–444. https://doi.org/10.1007/s10690-015-9206-2 doi: 10.1007/s10690-015-9206-2
![]() |
[28] |
J. Lin, J. Chen, Y. Li, Bayesian Analysis of Student t Linear Regression with Unknown Change-Point and Application to Stock Data Analysis, Comput. Econ., 40 (2012), 203–217. https://doi.org/10.1007/s10614-011-9305-8 doi: 10.1007/s10614-011-9305-8
![]() |
[29] |
O. Seidou, J. Asselin, T. Ouarda, Bayesian multivariate linear regression with application to change point models in hydrometeorological variables, Water Resour. Res., 43 (2007), 387–390. https://doi.org/10.1029/2005WR004835 doi: 10.1029/2005WR004835
![]() |
[30] |
H. Zhang, J. Stephen, J. Carter, Change Point Detection and Trend Analysis for Time Series, Chin. J. Chem. Phys., 35 (2022), 399–406. https://doi.org/10.1063/1674-0068/cjcp2112266 doi: 10.1063/1674-0068/cjcp2112266
![]() |
[31] |
X. Wu, X. Zhou, Y. Chen, K. Zhai, R. Sun, G. Luo, et al., The Impact of COVID-19 Lockdown on Cases of and Deaths From AIDS, Gonorrhea, Syphilis, Hepatitis B, and Hepatitis C: Interrupted Time Series Analysis, JMIR Public Health Surveill., 9 (2023), 399–406. https://doi.org/10.2196/40591 doi: 10.2196/40591
![]() |
[32] |
M. Xiridou, P. Adam, A. Meiberg, M. Visser, A. Matser, J.de Wit, et al., The impact of the COVID-19 pandemic on hepatitis B virus vaccination and transmission among men who have sex with men: A mathematical modelling study, Vaccine., 40 (2022), 4889–4896. https://doi.org/10.1016/j.vaccine.2022.06.075 doi: 10.1016/j.vaccine.2022.06.075
![]() |
1. | Tianhui Sha, Yikai Zhang, Yong Peng, Wanzeng Kong, Semi-supervised regression with adaptive graph learning for EEG-based emotion recognition, 2023, 20, 1551-0018, 11379, 10.3934/mbe.2023505 | |
2. | Juan A. Ramirez-Quintana, Felipe E. Garay Acuña, Mario I. Chacon-Murguia, Alejandro A. Torres-García, Alma D. Corral-Saenz, 2025, Chapter 9, 978-3-031-75542-2, 113, 10.1007/978-3-031-75543-9_9 |
Method | Accuracy | Precision | Recall |
CF | 0.6375 | 0.6412 | 0.7277 |
WCF | 0.6680 | 0.6667 | 0.7460 |
RFVS-CF | 0.6633 | 0.6715 | 0.7354 |
mRMR-CF | 0.6508 | 0.6530 | 0.7347 |
EN-CF | 0.6234 | 0.6335 | 0.7151 |
AEN-CF | 0.6398 | 0.6516 | 0.7201 |
CRRAEN-CF | 0.7047 | 0.7192 | 0.7571 |
CRRAEN-WCF | 0.7477 | 0.7614 | 0.7888 |
Method | Accuracy | Precision | Recall |
CF | 0.7477 | 0.7543 | 0.7795 |
WCF | 0.7688 | 0.7784 | 0.7956 |
RFVS-CF | 0.7648 | 0.7862 | 0.7879 |
mRMR-CF | 0.7469 | 0.7735 | 0.7702 |
EN-CF | 0.7375 | 0.7576 | 0.7650 |
AEN-CF | 0.7672 | 0.7847 | 0.7913 |
CRRAEN-CF | 0.7727 | 0.8325 | 0.7820 |
CRRAEN-WCF | 0.8094 | 0.8382 | 0.8250 |
Reference | Feature | Feature selection | Classifier | Number of classes | accuracy |
Koelstra et al. (2010) [32] | PSD | SVM | 3 | valence: 58.8 arousal: 55.7 | |
Wang and Shang (2013) [33] | DBN | 2 | valence: 51.2 arousal: 60.9 | ||
Jirayucharoensak et al. (2014) [34] | PS, difference between PS | SAE | 3 | valence: 49.52 arousal: 46.03 | |
PS, difference between PS | PCA | SAE | 3 | valence: 52.53 arousal: 49.17 | |
Ackermann et al. (2016) [35] | statistics, STFT, Higher Order Crossing, Hilbert-Huang Spectrum | MRMR | RF | 3 | anger, surprise and other: 55.23 |
Atkinson and Campos (2016) [36] | statistics, band power (BP), Hjorth parameters (HP), fractal dimension (FD) | MRMR | GA-SVM | 2 | valence: 73.14 arousal: 73.06 |
Li et al. (2017) [11] |
CWT, scale-map transform | CRNN | 2 | valence: 72.06 arousal: 74.12 |
|
Mert and Akan (2018) [37] |
MEMD-PSD, HP, entropy | ICA | KNN | 2 |
valence: 67 arousal: 51.01 |
Tiwari and Falk (2019) [3] | Benchmark Features, Motif-Based Features |
ANOVA | SVM | 2 | valence: 59.3 arousal: 54.46 |
Benchmark Features, Motif-Based Features |
MRMR | SVM | 2 | valence: 58.16 arousal: 56.45 | |
Benchmark Features, Motif-Based Features |
RFE | SVM | 2 | valence: 60.10 arousal: 55.98 |
|
Yin et al. (2020) [38] |
Temporal, frequency, statistics | LRFS | LSSVM | 2 | valence: 70.9 arousal: 67.43 |
Fang et al. (2021) [39] | PSD, DE | MRMR | CF | 5 | angry, happy, sad, pleasant, and neutral: 51.3 |
Pandey and Seeja (2022) [12] | VMD-PSD, First Difference of IMF | DNN | 2 | valence: 62.5 arousal: 61.25 | |
EMD-PSD, First Difference of IMF | DNN | 2 | valence: 56 arousal: 60 | ||
Wang et al. (2022) [15] | Transformer | 2 | valence: 66.63 arousal: 66.2 |
||
Proposed method | VMD-PSD, PE, DE, Hjorth parameters | CRRAEN | WCF | 2 | valence: 80.94 arousal: 74.77 |
Method | Time(s) |
CF | 1897.95 |
WCF | 1698.97 |
RFVS-CF | 259.90 |
mRMR-CF | 208.65 |
EN-CF | 259.51 |
AEN-CF | 224.36 |
CRRAEN-CF | 496.52 |
CRRAEN-WCF | 474.68 |
Method | Accuracy | Precision | Recall |
CF | 0.6375 | 0.6412 | 0.7277 |
WCF | 0.6680 | 0.6667 | 0.7460 |
RFVS-CF | 0.6633 | 0.6715 | 0.7354 |
mRMR-CF | 0.6508 | 0.6530 | 0.7347 |
EN-CF | 0.6234 | 0.6335 | 0.7151 |
AEN-CF | 0.6398 | 0.6516 | 0.7201 |
CRRAEN-CF | 0.7047 | 0.7192 | 0.7571 |
CRRAEN-WCF | 0.7477 | 0.7614 | 0.7888 |
Method | Accuracy | Precision | Recall |
CF | 0.7477 | 0.7543 | 0.7795 |
WCF | 0.7688 | 0.7784 | 0.7956 |
RFVS-CF | 0.7648 | 0.7862 | 0.7879 |
mRMR-CF | 0.7469 | 0.7735 | 0.7702 |
EN-CF | 0.7375 | 0.7576 | 0.7650 |
AEN-CF | 0.7672 | 0.7847 | 0.7913 |
CRRAEN-CF | 0.7727 | 0.8325 | 0.7820 |
CRRAEN-WCF | 0.8094 | 0.8382 | 0.8250 |
Reference | Feature | Feature selection | Classifier | Number of classes | accuracy |
Koelstra et al. (2010) [32] | PSD | SVM | 3 | valence: 58.8 arousal: 55.7 | |
Wang and Shang (2013) [33] | DBN | 2 | valence: 51.2 arousal: 60.9 | ||
Jirayucharoensak et al. (2014) [34] | PS, difference between PS | SAE | 3 | valence: 49.52 arousal: 46.03 | |
PS, difference between PS | PCA | SAE | 3 | valence: 52.53 arousal: 49.17 | |
Ackermann et al. (2016) [35] | statistics, STFT, Higher Order Crossing, Hilbert-Huang Spectrum | MRMR | RF | 3 | anger, surprise and other: 55.23 |
Atkinson and Campos (2016) [36] | statistics, band power (BP), Hjorth parameters (HP), fractal dimension (FD) | MRMR | GA-SVM | 2 | valence: 73.14 arousal: 73.06 |
Li et al. (2017) [11] |
CWT, scale-map transform | CRNN | 2 | valence: 72.06 arousal: 74.12 |
|
Mert and Akan (2018) [37] |
MEMD-PSD, HP, entropy | ICA | KNN | 2 |
valence: 67 arousal: 51.01 |
Tiwari and Falk (2019) [3] | Benchmark Features, Motif-Based Features |
ANOVA | SVM | 2 | valence: 59.3 arousal: 54.46 |
Benchmark Features, Motif-Based Features |
MRMR | SVM | 2 | valence: 58.16 arousal: 56.45 | |
Benchmark Features, Motif-Based Features |
RFE | SVM | 2 | valence: 60.10 arousal: 55.98 |
|
Yin et al. (2020) [38] |
Temporal, frequency, statistics | LRFS | LSSVM | 2 | valence: 70.9 arousal: 67.43 |
Fang et al. (2021) [39] | PSD, DE | MRMR | CF | 5 | angry, happy, sad, pleasant, and neutral: 51.3 |
Pandey and Seeja (2022) [12] | VMD-PSD, First Difference of IMF | DNN | 2 | valence: 62.5 arousal: 61.25 | |
EMD-PSD, First Difference of IMF | DNN | 2 | valence: 56 arousal: 60 | ||
Wang et al. (2022) [15] | Transformer | 2 | valence: 66.63 arousal: 66.2 |
||
Proposed method | VMD-PSD, PE, DE, Hjorth parameters | CRRAEN | WCF | 2 | valence: 80.94 arousal: 74.77 |
Method | Time(s) |
CF | 1897.95 |
WCF | 1698.97 |
RFVS-CF | 259.90 |
mRMR-CF | 208.65 |
EN-CF | 259.51 |
AEN-CF | 224.36 |
CRRAEN-CF | 496.52 |
CRRAEN-WCF | 474.68 |