
Synthesizing hyperspectral images (HSI) from an ordinary camera has been accomplished recently. However, such computation models require detailed properties of the target camera, which can only be measured in a professional lab. This prerequisite prevents the synthesizing model from being installed on arbitrary cameras for end-users. This study offers a calibration-free method for transforming any camera into an HSI camera. Our solution requires no controllable light sources and spectrometers. Any consumer installing the program should produce high-quality HSI without the assistance of optical laboratories. Our approach facilitates a cycle-generative adversarial network (cycle-GAN) and sparse assimilation method to render the illumination-dependent spectral response function (SRF) of the underlying camera at the first part of the setup stage. The current illuminating function (CIF) must be identified for each image and decoupled from the underlying model. The HSI model is then integrated with the static SRF and dynamic CIF in the second part of the stage. The estimated SRFs and CIFs have been double-checked with the results by the standard laboratory method. The reconstructed HSIs have errors under 3% in the root mean square.
Citation: Yenming J. Chen, Jinn-Tsong Tsai, Kao-Shing Hwang, Chin-Lan Chen, Wen-Hsien Ho. Intelligent synthesis of hyperspectral images from arbitrary web cameras in latent sparse space reconstruction[J]. AIMS Mathematics, 2023, 8(11): 27989-28009. doi: 10.3934/math.20231432
[1] | María Ángeles García-Ferrero, Angkana Rüland . Strong unique continuation for the higher order fractional Laplacian. Mathematics in Engineering, 2019, 1(4): 715-774. doi: 10.3934/mine.2019.4.715 |
[2] | Lorenzo Brasco . Convex duality for principal frequencies. Mathematics in Engineering, 2022, 4(4): 1-28. doi: 10.3934/mine.2022032 |
[3] | Antonio Iannizzotto, Giovanni Porru . Optimization problems in rearrangement classes for fractional p-Laplacian equations. Mathematics in Engineering, 2025, 7(1): 13-34. doi: 10.3934/mine.2025002 |
[4] | Filippo Gazzola, Gianmarco Sperone . Remarks on radial symmetry and monotonicity for solutions of semilinear higher order elliptic equations. Mathematics in Engineering, 2022, 4(5): 1-24. doi: 10.3934/mine.2022040 |
[5] | Alessandra De Luca, Veronica Felli . Unique continuation from the edge of a crack. Mathematics in Engineering, 2021, 3(3): 1-40. doi: 10.3934/mine.2021023 |
[6] | Luigi Provenzano, Alessandro Savo . Isoparametric foliations and the Pompeiu property. Mathematics in Engineering, 2023, 5(2): 1-27. doi: 10.3934/mine.2023031 |
[7] | Giovanni S. Alberti, Yves Capdeboscq, Yannick Privat . On the randomised stability constant for inverse problems. Mathematics in Engineering, 2020, 2(2): 264-286. doi: 10.3934/mine.2020013 |
[8] | Alessandro Michelangeli, Raffaele Scandone . On real resonances for three-dimensional Schrödinger operators with point interactions. Mathematics in Engineering, 2021, 3(2): 1-14. doi: 10.3934/mine.2021017 |
[9] | Catharine W. K. Lo, José Francisco Rodrigues . On the obstacle problem in fractional generalised Orlicz spaces. Mathematics in Engineering, 2024, 6(5): 676-704. doi: 10.3934/mine.2024026 |
[10] | Peter Bella, Mathias Schäffner . Local boundedness for p-Laplacian with degenerate coefficients. Mathematics in Engineering, 2023, 5(5): 1-20. doi: 10.3934/mine.2023081 |
Synthesizing hyperspectral images (HSI) from an ordinary camera has been accomplished recently. However, such computation models require detailed properties of the target camera, which can only be measured in a professional lab. This prerequisite prevents the synthesizing model from being installed on arbitrary cameras for end-users. This study offers a calibration-free method for transforming any camera into an HSI camera. Our solution requires no controllable light sources and spectrometers. Any consumer installing the program should produce high-quality HSI without the assistance of optical laboratories. Our approach facilitates a cycle-generative adversarial network (cycle-GAN) and sparse assimilation method to render the illumination-dependent spectral response function (SRF) of the underlying camera at the first part of the setup stage. The current illuminating function (CIF) must be identified for each image and decoupled from the underlying model. The HSI model is then integrated with the static SRF and dynamic CIF in the second part of the stage. The estimated SRFs and CIFs have been double-checked with the results by the standard laboratory method. The reconstructed HSIs have errors under 3% in the root mean square.
Abbreviation: ACC: Accelerometer; ANN: Artificial neural network; BP: Blood pressure; BVP: Blood volume pulse; CNN: Convolutional neural network; DNN: Deep neural network; ECG: Electrocardiograph; EDA: Electrodermal activity; EEG: Electroencephalography; EMG: Electromyogram; GSR: Galvanic skin response; HR: Heart rate; HRV: Heart rate variability; IBI: Inter-beat interval; KNN: K-nearest neighbor: LASSO: Least absolute shrinkage selection operator; PCA: Principal component analysis; PPG: Photoplethysmography; RF: Random forest; RR: Respiration rate; RBF: Radial basis function kernel classifier; ST: Skin temperature; SVM: Support vector machine
Mental health problems are also known as mental health disorders, and they have become an important factor affecting individual growth and social stability [1]. According to the "Global Burden of Disease Study", the global prevalence of mental health disorders has always been above 10%, and about 450 million people have mental health disorders [2,3]. Long-term and persistent mental health problems increase the risk of cardiovascular disease, which is the leading cause of increased mortality worldwide [4]. Mental health disorders not only damage the physical and mental health of individuals, but they also increase the overall medical burden on society [5]. It is predicted that the total global medical expenditure for mental health problems will reach 16 trillion US dollars by 2030 [6]. Therefore, it is imperative to detect the mental health condition of individuals at an early stage and prevent the occurrence of mental health problems.
Smart sensors can accurately detect subtle changes (such as insomnia, headache, rapid heartbeat and muscle tension) in the body caused by stress in order to prevent and reduce the adverse effects of mental health problems [7,8]. With the increasing need for mental health care, the application of portable smart devices for monitoring mental health has received widespread attention and has been welcomed by most people [9]. At present, there are many studies on the use of wearable devices for mental health monitoring. Previous reviews have integrated the use of wearable sensors to detect stress levels, and there is a lack of exhaustive reviews on detection topics such as mental health problems or mental illness [8,10,11,12,13]. However, although some of the studies covered comprehensive information, the classification of detected signals were not described in detail in these reviews [14,15,16,17]. To fill this gap, this paper analyzes and integrates a large number of studies, covering the types of equipment used for data collection, common locations of sensors, common signals, advantages and disadvantages of technology, types of mental health states, etc. In this section, we discuss some existing studies. Abbreviations of nomenclatures used in this review paper are enumerated at the beginning of the article.
First, smart wearable devices have been widely used in the study of classifying and identifying individuals' daily stress and mental fatigue. Betti et al. [18] measured the electroencephalography (EEG), electrocardiography (ECG) and electrodermal activity (EDA) of healthy adults using wearable devices and a support vector machine (SVM) classification algorithm for statistical analysis; the results show that the accuracy of the classification algorithm based on salient features can reach 86% for detecting individual stress levels. Goumopoulos et al. [19] proposed a method to detect mental fatigue; they used a wearable device to collect the heart rate variability (HRV) of healthy subjects, and then selected an appropriate machine learning model to predict mental fatigue with high accuracy. Additionally, the role of wearable devices in the prediction and classification of mental health assessment and emotional states cannot be ignored. Sano et al. [20] explored the relationships between the academic performance, sleep quality, stress perception and mental health of college students using wearable devices and smartphones; they found that college students' academic performance, sleep quality and stress perception can significantly predict their mental health level, and that the accuracy rate of the algorithm is as high as 92%. Costa et al. [21] found that wearable devices can acquire and analyze social signals and identify the current negative emotions of individuals, and that, when it identifies a negative emotion, the device activates the emotional regulation system to help the individual stabilize their emotions. Overall, wearable devices have been widely used in the field of mental health with strong practicability. The main contributions of this paper are given as follows:
1) Recent artificial intelligence (AI)-based wearable devices methods and techniques are reviewed in this paper. This study comprehensively explores the application of AI-based wearable devices in the field of mental health, focusing on stress levels, mental health problems and mental illness.
2) This study covers comprehensive and detailed information, including types and application mechanisms for AI-based wearable devices, common locations of sensors, common signals, the advantages and disadvantages of technology and types of mental health states.
3) The articles included in this paper are from 2017 to 2022, and the articles are relatively new and cutting-edge.
The remainder of this paper is organized as follows. Section 2 introduces the review of body signals and mental health. Section 3 summarizes the methodology and Section 4 gives the literature survey. Finally, application, discussion and conclusion are given in Sections 5, 6 and 7.
The autonomic nervous system has two branches: the sympathetic nervous system (SNS) and parasympathetic nervous system (PNS). The SNS can rapidly stimulate some vital organs (such as the heart, lungs, and sweat glands), and the PNS can inhibit this excitation; they complement the regulation of important bodily functions and maintenance of physiological homeostasis [22]. When an individual is stressed, the individual will activate the protective mechanism; the SNS will be activated, the heart rate will increase, the breathing will increase, sweat will be secreted and other physiological systems will also be affected [23,24]. Additionally, the hypothalamus axis is a hormonal system that is initiated in response to stressors and drives a cascade of endocrine responses, culminating in the secretion of cortisol [22]. Therefore, individual health, especially mental health, can be known by observing the activity of the SNS and hypothalamus axis. Finally, language signals can also reflect an individual's mental health condition. Language is the basis for inferring individual thought processes; it provides a special perspective for thinking observations [25]. When an individual is in a state of tension, the respiratory rate increases and the glottis pressure increases, prompting an increase in the pitch of the vocalized part [25]. Therefore, clinicians mainly diagnose and treat mental illness through language and pronunciation [26]. As mentioned above, subtle changes in the body caused by stress can be detected by sensors connected to the body and hardware devices to accurately predict mental illness [27]. Therefore, AI-based wearable devices can collect one or more signals of the individual to identify and analyze the mental health level.
This review incorporated searches for relevant papers in the databases of Web of Science, Embase, MEDLINE and Engineering Village. The selected papers ranged from 2017 and 2022, which covers the recent developments in this area. The basic sets of keywords were "Psychological assessment", "psychological prediction", "psychological monitoring", "mental health", "psychological health", "mental illness", "psychological disorders" and "wearable device". A total of 1537 papers were retrieved by using all keywords, and 25 papers were finally included. The search methodology for this article was designed to provide a detailed review of published research papers on wearable sensor-based mental health detection. Figure 1 shows a flowchart describing the search strategy used for this review article.
1) The date of publication of the selected papers was between 2017 and 2022. 2) Conference papers and journal papers were included. 3) The research direction for the paper was mental health detection (focusing on stress, mental health problems, mental illness, etc.) and AI-based wearable devices for data collection.
1) Research published in a language other than English was excluded. 2) Studies structured as editorials, press releases, research highlights or letters, review articles, reviews and technical papers were excluded. 3) Duplicate papers, related review papers and papers not related to the topic were excluded.
We use smart/embedded wearable devices, such as smartwatches, that collect signals from the body [28]. This data can be processed by sensors on smart/embedded devices. WiFi is used to send the collected data to cloud servers [7]. In cloud computing, advanced AI models can be applied to mental health monitoring big data to generate simple knowledge through the use of expensive processing and advanced AI algorithms [29], such as Welch's algorithm [30], SVM algorithms [31] and period graphs [32], for the analysis of data. Using these AI algorithms, then by analyzing the information, we will be able to extract sensor data with the appropriate classification characteristics and then classify their mental health states [33]. Artificial intelligence algorithms such as information and SVMs are used again. The main function of an analog classifier is to classify different classifiers [34]. By ensuring the rapidity of calculation, feature patterns are transformed into related classes and the classes most suitable for a single feature are predicted. During the classification process, each feature will be analyzed to find the most suitable feature [35]. They will be analyzed separately to predict stresses. Once the classification is complete, the smart device displays their mental health rating on the screen. Therefore, people can improve their mental health through self-management (as shown in Figure 2). The virtual reality-augmented reality (VR-AR) based digital twin is the critical element for the intelligent development of AI-based wearable devices [36]. Finally, all of the elements are integrated into a smart device's ecology for the future direction framework.
Mental health detection systems based on wearable devices mainly consist of sensors, data acquisition, data pre-processing, feature extraction, machine learning algorithms and mental health condition classification. The general scheme of mental health prediction is shown in Figure 3. Section 4 will systematically introduce the current related research work according to the workflow of the wearable mental health detection system, as well as classify, compare and statistically analyze the existing research results.
Currently, the commonly used signals in mental health detection are HRV, EEG, galvanic skin response (GSR), skin temperature (ST), blood pressure (BP), sleep, salivary cortisol and blood oxygen saturation (SpO2). In most studies, researchers use multiple indicators to accurately reflect an individual's mental health. At the same time, the wearable device can also collect indicators such as steps, activity category, posture and activity trajectory. According to the needs of users and the functions achieved by the device, the device will be worn on different parts of the body. Common wearing positions are shown in Figure 4.
To solve the noise phenomenon in the data acquisition module, data pre-processing is required to reduce the impact of noise on the performance of mental health detection. Common filtering algorithms in mental health detection systems are wavelet transforms, the Kalman filter, etc. These filters can filter out high-frequency noise signals in the data [37,38].
The pre-processed data need to extract corresponding features for training the mental health detection model. Among them, the common feature extraction method is to use the sliding window algorithm to extract the features in the corresponding window, and then put the trained features into the trainer for training to form a mental health detection model. Table 1 summarizes the features extracted by the feature extraction module of an existing detection system. The commonly used feature values for feature extraction are Mean, Standard Deviation (StDev), Skew, Root Mean Square (RMS), Kurtosis, Quart1, Quart3, Inter-Quartile Range (IRQ), Sum, Frequency, etc.
References | Feature | Description |
[19,39,40,41,42,43,44,45,46,47,48,49,50,51] | Mean | Mean=∑is2i/n |
[19,39,40,41,42,43,44,45,46,47,49,50,51] | StDev | sd=√N∑i=1(xi−μ)2/(N−1) |
[40,45] | Skew | G1=n(n−1)(n−2)n∑i=1(xi−¯xs)3 |
[19,40,43] | RMS | rmΔ(s)=√∑is2j/x |
[39,40,45,51] | Kurtosis | K=μ/σ |
[39,40,45] | Quart1 | Median of lower 25% of the values |
[39,40,45] | Quart3 | Median of up 25% of the values |
[39,40,45,51] | IRQ | Difference between Quart1 and Quart3 |
[41,42,44] | Sum | S=N∑i=1wi |
[45] | Variance | sd2=1/nn∑i=1(xi−μ)2 |
[43] | Coefficient of variation | CV=sd/μ |
[45] | Inverse coefficient of variation | ICV=μ/sd |
[18,19,41,43,46,47,50,51,52] | Frequency | Peak frequency, low-frequency power (LF), high-frequency power (HF), very-low-frequency power (VLF), prevalent low frequency (pLF), prevalent high frequency (pHF), the ratio of LF to HF (LF/HF), (From Lomb–Scargle) LF, HF and LF/HF |
Machine learning technology refers to the method of using some special formulas to guide the computer to use the known data to build a model that can make predictions for new situations. Common machine learning algorithms for mental health detection include SVMs [18,19,39,43,44,48,51,52,53,54,55] k-nearest neighbor (KNN) algorithms [19,48,51,53], random forests (RF) [40,45,51], artificial neural networks (ANNs) [39], logistic regression (LR) [19,42,56,50,51], decision trees (DTs), Bayesian networks (BNs) [48], linear discriminant analysis (LDA) [51] and principal component analysis (PCA) [47,51]. The SVM is the most used algorithm for mental health detection among them. This paper summarizes the artificial intelligence algorithms involved in mental health detection and briefly describes their definitions, advantages and disadvantages, as shown in Table 2.
Machine learning algorithms | Definition | Advantages | Disadvantages |
Support Vector Machine (SVM) | In order to better find the hyperplane that can divide the feature space, the low-dimensional data are mapped to the high-dimensional plane | Works well with any complex problem, can scale high-dimensional data, risk of overfitting is lower; It shows high robustness and strong generation ability with low overfitting | Sensitive to missing data; It is difficult to implement large-scale training samples; There are difficulties in solving the multi-classification problem |
K-Nearest Neighbor (KNN) | In the feature space, if k samples near a sample belong to a category, that sample also belongs to that category | Simple thought, mature theory; Can be used for multiple classification, high accuracy, no assumptions on the data, not sensitive to outliers | Poor generalization ability and slow prediction speed |
Random Forests (RF) | A classifier for multiple decision trees | It can handle high-dimension data, has strong model generalization ability, and is fast in training. It can balance errors in unbalanced data sets, maintain accuracy if a large part of features is lost | In noisy classification or regression problems, overfitting and attribute weights produced based on data of different attribute values are not credible |
Artificial Neural Network (ANN) | A mathematical model for distributed parallel information processing that mimics the features of human neural network | It has a high accuracy of classification, strong parallel distribution processing ability, strong distribution storage and learning ability, adaptability, strong robustness and fault tolerance to noise nerve | Many parameters are required, which are not suitable for small samples, high calculation cost, long learning process, and intermediate results cannot be observed |
Logical Regression (LR) | The result of linear regression is mapped between 0 and 1 by a sigmoid function. The result of the mapping is the probability that the data sample point belongs to a certain category | The computational cost is low, the implementation is easy to understand, the robustness to small noise in the data is good, and it is not particularly affected by slight multicollinearity | It is easy to underfit, and the performance effect is not good when the classification accuracy is not high, the data features are missing or the feature space is large |
Decision Tree (DT) | A prediction model is a mapping between object attributes and object values | Easy to understand and explain, visualization of tree structure, less data required for training, fast analysis speed, and strong ability to process samples with missing attributes | Overfitting, poor generalization ability and unstable calculation results |
Bayesian Network (BN) | Graphical networks based on probabilistic inference | The process is simple and fast, efficient for multi-classification problems, and requires a smaller sample size | Sample attributes do not work well when they are associated and are sensitive to input forms |
Linear Discriminant Analysis (LDA) | The labeled data is projected to a lower dimensional space so that the points after projection are distinguished by category. Points of the same category are closer to each other in the space after projection, and the differences are as far as possible | It can reflect the differences between samples and use the prior knowledge experience of categories in dimension reduction | Easy to fit, restricted by sample type |
Principal Component Analysis (PCA) | A method of multivariate statistical analysis in which several variables are transformed linearly to select a smaller number of important variables | Eliminate the mutual influence between indicators, small workload | The amount of information must be kept at a high level after dimensionality reduction, and the interpretation of principal components is usually fuzzy |
Mental health problems are often accompanied by obvious body responses, and wearable sensors play an important role in mental health and its related applications. They can collect an individual's EEG, heart rate (HR), HRV, GSR, BP, body temperature, respiratory rate, etc., to monitor mental health. This section summarizes and discusses the signals commonly used in mental health detection, and briefly summarizes the relevant research on AI-based wearable devices in mental health detection over the past five years. The main contents include mental health disease types, parameters, results, etc., as shown in Table 3.
Study | Type | Device | Signals | Model | Result |
Can et al., 2019 [51] | Stress | Samsung Gear S, S2 and Empatica E4 wristband | End Diastolic Velocity, HRV and ACC | PCA, LDA, SVM, KNN, RF and LR | The accuracy of the multi-layer perceptron algorithm is 92.19%. |
Betti et al., 2018 [18] | Stress | Mobile EEG headset, Chest belt | HRV, EDA, EEG and Salivary cortisol | SVM | The classification algorithm based on salient features has an accuracy of 86%. |
Egilmez et al., 2017 [40] | Stress | LG smartwatch | GSR | RF | The pressure detection F-value of the system is 88.8%. |
Gjoreski et al., 2019 [39] | Stress | Empatica device's wristband | HR, BVP, IBI, ST and ACC | SVM | Experiments with 55 days of real data showed that the method detected stress events with 95% accuracy. |
Ahn et al., 2019 [52] | Stress | Head-mounted electrode | ECG, EEG | SVM | The accuracy of the classification of pressure states using SVM technology and the 5-fold cross validation of EEG and HRV features was 87.5%. |
Wu et al., 2019 [50] | Stress | Smart shirt | Cortisol, HRV | LR | Mental stress index is sensitive to acute stress and can predict the level of association between normal individuals and the stress group with about 97% accuracy. |
Jesmin et al., 2020 [53] | Stress | Smartwatch | HRV, GSR and SpO2 | SVM, KNN | Multi-sensor data fusion can better monitor individual stress levels. |
Bobade et al., 2020 [41] |
Stress | Chest-worn device, RespiBAN Professional and a wrist-worn device Empatica E4 | ACC, ECG, BVP, ST, RR, EMG and EDA | Machine Learning |
Using machine learning technology, the accuracy of Class Ⅲ and Class Ⅱ classification problems reached 81.65 and 93.20%, respectively, and using deep learning technology, the accuracy of Class Ⅲ and Class Ⅱ classification problems reached 84.32 and 95.21%, respectively. |
Silva et al., 2020 [49] | Stress | Smartwatch | HRV | ANN | Several parameters of the HRV under pressure were significantly different from baseline. |
Kim et al., 2020 [42] | Stress | Empatica E4 wristband | GSR | LR | The overall classification accuracy of the model was 85.3%, and the cross-validation accuracy was 83.2%. |
Han et al., 2020 [48] | Stress | Smart wrist band, Empatica E4 wristband and a Shimmer ECG device | ECG, PPG and GSR | KNN, SVM, and BN | The accuracy of 10-fold cross-validation was 94.55%, while the accuracy of subject cross-validation was 85.71%. In daily settings, the system had 81.82% accuracy in assessing stress. |
Tonacci et al., 2019 [62] | Stress | Shimmer™, Shimmer3™ | GSR, ECG | Pan Tompkins algorithm | HR increased with both sensory (olfactory) and cognitive stimulation, and there was a similar decrease in HRV. |
Tiwari et al., 2019 [43] | Stress and anxiety |
OMsignal smart-shirt and a Fitbit Charge 2 smart-bracelet | HRV | SVM | Overall, HRV characteristics calculated from wearable vest data (with higher temporal resolution) proved to be more effective in predicting stress and anxiety. |
Patlar Akbulut et al., 2020 [47] | Stress and anxiety |
Wrist-worn wearable device | ECG, GSR, ST, SpO2, Blood sugar and BP | PCA, Feed-forward neural network | Method had an average accuracy of 92 and 89% in distinguishing the stress levels of the experimental group and the control group, respectively. |
Sano et al., 2018 [54] | Stress and mental health condition | Q-sensor and Motion logger | SC, ST, ACC and Motion logger | SVM, LASSO and RBF | The sensors were 78.3% accurate in classifying students as high or low stress groups, and 87% accurate in classifying students as high or low mental health groups. |
Fraiwan et al., 2018 [65] | Mental health condition | Smart glove | ST and HRV | HRV algorithm | There was significant variability in skin resistance and HRV under relaxation and stimulation. |
Chen et al., 2018 [68] | Mental health condition | Smartwatch | Speech | Transfer learning | The use of wearable long-term speech-to-social data sets to categorize various sound scenes in unconstrained and natural environments yielded good results. |
Wen et al., 2020 [44] | Anxiety | The MP150 multi-channel physiological recorder | HR (ECG electrodes) | SVM | The classifier achieved 81.82% accuracy in detecting a high anxiety state. |
O'Brien et al., 2017[56] | Depression |
Wrist-mounted device |
ACC | LR | General physical activity was significantly reduced in depressed older adults compared with healthy controls. |
Narziev et al., 2020 [67] | Depression |
Smartwatch | Motion logger and HR | Machine learning |
Participants' ecological transient assessment self-report and passive perception (sensor data) were highly correlated between physical activity, mood and sleep levels; Depression detector verified the feasibility of grouping and classification with an accuracy rate of 96.00% (StDev = 2.76). |
Zanella-Calzada et al., 2019 [45] | Depression |
Actiwatch | Motion Logger | RF | Patients with depression were accurately detected in 86.7% of the cases, while those without depression were found in 91.9% of the cases. |
Gentili et al., 2017 [55] | Bipolar disorder | Sensorized t-shirt | HRV, respiration and ACC | SVM | After comparing the HRV features of the emotional states before and after, the classification accuracy of the SVM in the target state was significantly improved. |
Zheng et al., 2019 [46] | Emotion recognition |
Head/wearable eye-tracking glasses, EmotionMeter hardware | EEG and Eye-movement signal | Multi-mode deep neural network model | The modal fusion performance of the multi-mode deep neural network was significantly improved, and the average accuracy of the four emotions (happiness, sadness, fear and neutral) reached 85.11%. |
Goumopoulos et al., 2022 [19] | Mental fatigue | Zephyr HXM-BT, Chest-strap built-in ECG sensor | HRV | SVM, KNN and LR | Mental fatigue caused by cognitive load can be detected by wearable commercial devices and a single biomarker. |
Qiu et al., 2020 [38] | Mental fatigue | Head-mounted devices | ECG, EEG and ST | Wavelet transform, Kalman filtering | Through the fatigue evaluation experiment of 10 healthy subjects, it was verified that the device can reliably detect the mental fatigue state by monitoring and analyzing EEG, ECG and proximal ST data. |
EEG is a detection method that records the electrical activity of the cerebral cortex by connecting multiple electrodes to the scalp [57] to reflect the emotional changes of individuals to specific stimuli. The stress state of the human body is mainly identified by the changes in the alpha and delta bands of the EEG. When people are under high levels of stress, alpha decreases and delta increases in the EEG, whereas, when they are relaxed or inactive, alpha increases and delta decreases [58]. Ahn et al. [52] used a binaural hanging device that combined EEG and ECG signals and found that the accuracy of EEG and ECG signals in continuous stress assessment was 87.5%. Additionally, based on the multi-modal emotion recognition framework of EmotionMeter, some scholars found that the individual's happiness, sadness, fear and neutral emotions can be distinguished and identified by EEG signals [46]. Finally, EEG combined with ECG signals, body temperature and other data can reliably detect the mental fatigue state of an individual, which provides support for the popularization and specific implementation of mental health assessments [38]. Table 3 shows the studies and their details on the use of EEG for mental health detection. EEG is one of the most effective physiological signals for detecting psychological stress, and the combination of ECG signals and machine learning applications can improve the accuracy of psychological stress detection.
Stressors induce changes in myocardial activity, promote blood distribution to vital organs and cause changes in the HR and HRV [59]. ECG is a reliable technique that uses electrodes attached to the body to record the electrical potential produced by heart muscle cells to observe changes in HRV [59]. Studies have shown that HR and HRV signals are closely related to mental health problems (such as anxiety and depression) [60], and they have an important role in mental health detection. Tiwari et al. [43] found that HRV data obtained through smart t-shirts and smart bracelets can more accurately assess the degree of anxiety and depression of individuals. Second, HRV characteristics are closely associated with an individual's clinical course and mood changes. Gentili et al. [55] showed that the HRV characteristics (including time domain, frequency domain, nonlinearity) of patients with bipolar disorder can significantly predict their clinical course and emotional changes. Additionally, Akbulut et al. [47] believe that the stress state of an individual can be accurately identified by signals such as the HRV obtained by wearable devices. Table 3 presents an overview and the details of mental health testing using ECG.
Sweat is a biological fluid produced by sweat glands. Sweat glands are distributed almost all over the body, with high-density distribution in the palms, soles and forehead; they are only stimulated by the sympathetic nerves, which are activated under tension [14], so the level of sweat secretion is the body's response to stress. Since sweat contains ions, small molecules and even macromolecular substances such as proteins, it has electrical conductivity [61]. Therefore, the EDA signal is obtained by connecting two electrodes on the skin surface, and the voltage between them is measured to monitor the stress. Sriamprakash et al. [33] collected the GSR, HRV and other indicators, extracted individual stress characteristics by using Welch's algorithm and periodograms, etc., and used SVM and KNN algorithms to classify levels of mental health. Tonacci et al. [62] captured the ECG and skin conductance (SC) signals of subjects by using wearable sensors and compared the effects of cognitive stress and olfactory stimuli on human mental health. Additionally, EDA and GSR also change with different emotions; this can effectively reflect the state of emotional calm or emotional arousal [63]. Evidence shows that the emotional state of patients with metabolic syndrome can be analyzed by using indicators such as the GSR, HRV and body temperature detected by wearable devices [47]. The above studies have confirmed that combining GSR with HRV, body temperature and other parameters can improve the prediction rate for individual stress, and accurately provide and detect information about individual emotional states.
High levels of stress can cause muscle tension, increase the HR and increase the work of the vital organs; additionally, the blood will be shunted from the limbs to directly act on the vital organs. In just a few minutes, the body surface temperature can change [64]. Most of the studies involved the collection of body temperature data to detect mental health conditions. Fraiwan et al. [65] designed a mobile mental health monitoring system for continuous monitoring of changes in ST, skin electrical impedance and HRV; they found that, when the body was exposed to relaxation or stimulation, the skin electrical resistance value and HRV exhibited significant variability. Engelniederhammer et al. [66] monitored the EDA and ST in subjects wearing a smart wristband (Bodymonitor); the results showed that being in a crowded environment triggers aversive emotional responses (i.e., stress, anger and fear) in subjects. Sano et al. [54] had 201 college students wear two sensors on each wrist: a Q-sensor (for measuring SC, ST, triaxial acceleration (ACC)) and motion recorder (for measuring acceleration and ambient light data); the study showed that wearable sensor eigenvalues are more accurate than cell phones for classifying poor mental health (87%) and stress (78.3%) when using machine learning techniques. In conclusion, the ST, combined with parameters such as EDA, is more accurate for detecting emotional changes and stress levels.
With the development of artificial intelligence, the natural language processing program of the computer can more accurately identify the subtle barriers in language before the onset of mental illness, achieving the effect of accurately predicting the mental illness [27]. Corcoran et al. [27] confirmed that the automatic learning of speech classifiers with reduced semantic consistency, increased semantic consistency differences and reduced use of case pronouns can predict psychotic episodes with an accuracy rate of 83%. Language signals help to improve the accuracy of mental disease prediction and have become a powerful tool for psychiatric diagnosis and treatment.
In addition, mental illness is associated with changes in an individual's behavior, especially physical movement and social habits. AI-based wearable devices (accelerometers, global position system sensors) can detect these abnormal behaviors and effectively reduce the incidence of individual mental illness. Zanella-Calzada et al. [45] obtained data on motor activity recorded by wearable devices from a depression database, extracted the statistical features and then fed them to an RF classifier to detect subjects with depression; the results showed that those subjects with the presence of depression had an 86.7% likelihood of being correctly classified, while the specificity showed that those subjects with the absence of depression had a 91.9% likelihood of being classified with a correct response. Additionally, Narziev et al. [67] recruited 20 participants from four different levels of depression groups and established a machine learning model to automatically classify the categories of depression in a short period of time; the results of this pilot study revealed high correlations between participants' ecological momentary assessment self-reports and passive sensing (sensor data) in terms of physical activity, mood and sleep levels. All of the above studies have shown that the role of speech signals and behavioral signals in the detection of mental health states such as depression, mood disorders or schizophrenia cannot be ignored.
Multiple studies have integrated the use of wearable sensors to detect stress levels, and there is a lack of exhaustive reviews on topics such as mental health or illness detection. However, some reviews comprehensively cover information, but do not provide details such as machine learning methods, types of mental health states and the classification of detection signals. To fill this gap, this paper presents an extensive review of mental health detection using wearable sensors.
The significant observations from this review are listed below.
● Most wearable sensors are placed on the wrist (such as smartwatches and smart wristbands), chest (smart t-shirts, chest straps, etc.) and head (head-mounted devices, etc.) for data collection.
● The Empatica, EmotionMeter and SHIMMER platforms have been popularly used for data collection.
● HRV, EEG, GSR and ST are the most distinctive signals for detecting mental health conditions.
● SVM, LR and KNN have been widely used in the classification of mental health conditions.
● Most of the disease types studied focused on psychological stress levels and mental health problems (anxiety and depression).
The identified challenges are described below.
● Compliance: The collection of subjects' physiological indicators and daily activity data is affected by individual living habits, which makes the detection process more complicated and thus degrades the performance of the mental health detection system. Therefore, the mental health assessment based on wearable devices can only do a rough screening and cannot directly assess the disease accurately.
● Singleness of indicators: most studies involve the collection of only a single category of signals, such as physiology, speech and behavior, while fewer collect a combination. Since different categories of data and signals reflect different aspects of mental health, the integrated collection of all types of signals is essential for the accurate screening of mental health problems.
Globally, mental health problems are still serious social public health problems, and mental health has always been detected via subjective and time-consuming methods. Wearable sensors have great potential as a continuous monitoring system for mental health. As the needs for mental health care services increase, so does the production of AI-based wearable devices. This paper reviewed the types of equipment, common locations of sensors, common signals, advantages and disadvantages of technology and types of mental health states in previous studies. First of all, the most common types of states in mental health testing are psychological stress and mental health problems (anxiety and depression), and most wearable sensors are placed on the wrist, chest or head. Additionally, it was found that there are many AI-based wearable devices on the market that mainly collect physiological signals for mental health detection, and that there are few studies on detecting mental health with speech signals and behavioral signals. Among the physiological signals, the HRV, EEG, ST and GSR signals are the most effective parameter indicators for detecting the mental health condition and this conclusion is helpful for researchers to create devices to improve people's mental health level. Finally, this paper integrated important information, such as the technical advantages and limitations of previous studies; it was found that the most common classifiers are SVM, KNN and LR algorithms, and that these algorithms provide the most reliable results for mental health detection. In the future, with the continuous innovation of science and technology, deterministic annealing neural networks [69,70], convolutional neural networks [71,72,73], the logarithmic descent direction algorithm [74], Laplacian-Hessian regularization semi-supervised extreme learning machine [75,76] and back-propagation neural networks [77] are also expected to be applied and explored in the field of mental health detection. In a word, the goal of mental health testing is to develop an efficient and affordable model with high accuracy. This review will help future researchers design optimal wearable devices to achieve their goals for mental health detection.
This work is supported by the Changsha Natural Science Foundation No. kq2202422 and the Chinese Scholarship Council (CSC) under grant No. 202006370101.
All authors declare no conflicts of interest in this paper.
[1] | B. Arad, O. Ben-Shahar, Sparse recovery of hyperspectral signal from natural RGB images, in European Conference on Computer Vision, Springer, 19–34. (2016), https://doi.org/10.1007/978-3-319-46478-7_2 |
[2] |
I. Choi, D. S. Jeon, G. Nam, D. Gutierrez, M. H. Kim, High-quality hyperspectral reconstruction using a spectral prior, ACM T. Graphic., 36 (2017), 1–13. http://dx.doi.org/10.1145/3130800.3130810 doi: 10.1145/3130800.3130810
![]() |
[3] |
W. Jakob, J. Hanika, A low-dimensional function space for efficient spectral upsampling, Comput. Graph. Forum, 38 (2019), 147–155. https://doi.org/10.1111/cgf.13626 doi: 10.1111/cgf.13626
![]() |
[4] | Y. Jia, Y. Zheng, L. Gu, A. Subpa-Asa, A. Lam, Y. Sato, et al., From RGB to spectrum for natural scenes via manifold-based mapping, in Proceedings of the IEEE international conference on computer vision, 4705–4713, (2017). https://doi.org/10.1109/ICCV.2017.504 |
[5] | H. Kwon, Y. W. Tai, RGB-guided hyperspectral image upsampling, in Proceedings of the IEEE International Conference on Computer Vision, 307–315, (2015). https://doi.org/10.1109/ICCV.2015.43 |
[6] | S. W. Oh, M. S. Brown, M. Pollefeys, S. J. Kim, Do it yourself hyperspectral imaging with everyday digital cameras, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2461–2469, (2016). https://doi.org/10.1109/CVPR.2016.270 |
[7] |
Q. Li, X. He, Y. Wang, H. Liu, D. Xu, F. Guo, Review of spectral imaging technology in biomedical engineering: achievements and challenges, J. Biomed. Opt., 18 (2013), 100901–100901. https://doi.org/10.1117/1.JBO.18.10.100901 doi: 10.1117/1.JBO.18.10.100901
![]() |
[8] |
M. Aharon, M. Elad, A. Bruckstein, K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation, IEEE T. Signal Proces., 54 (2006), 4311–4322. https://doi.org/10.1109/TSP.2006.881199 doi: 10.1109/TSP.2006.881199
![]() |
[9] |
Z. Xing, M. Zhou, A. Castrodad, G. Sapiro, L. Carin, Dictionary learning for noisy and incomplete hyperspectral images, SIAM J. Imag. Sci., 5 (2012), 33–56. https://doi.org/10.1137/110837486 doi: 10.1137/110837486
![]() |
[10] |
O. Burggraaff, N. Schmidt, J. Zamorano, K. Pauly, S. Pascual, C. Tapia, et al., Standardized spectral and radiometric calibration of consumer cameras, Optics Express, 27 (2019), 19075–19101. https://doi.org/10.1364/OE.27.019075 doi: 10.1364/OE.27.019075
![]() |
[11] | J. Jiang, D. Liu, J. Gu, S. Süsstrunk, What is the space of spectral sensitivity functions for digital color cameras?, in 2013 IEEE Workshop on Applications of Computer Vision (WACV), IEEE, 168–179, (2013). https://doi.org/10.1109/WACV.2013.6475015 |
[12] | S. Han, Y. Matsushita, I. Sato, T. Okabe, Y. Sato, Camera spectral sensitivity estimation from a single image under unknown illumination by using fluorescence, in 2012 IEEE Conference on Computer Vision and Pattern Recognition, IEEE, 805–812, (2012). https://doi.org/10.1109/CVPR.2012.6247752 |
[13] |
G. Wu, L. Qian, G. Hu, X. Li, Spectral reflectance recovery from tristimulus values under multi-illuminants, J. Spectrosc., 2019. https://doi.org/10.1155/2019/3538265 doi: 10.1155/2019/3538265
![]() |
[14] |
L. Yan, X. Wang, M. Zhao, M. Kaloorazi, J. Chen, S. Rahardja, Reconstruction of hyperspectral data from RGB images with prior category information, IEEE T. Comput. Imag., 6 (2020), 1070–1081. https://doi.org/10.1109/TCI.2020.3000320 doi: 10.1109/TCI.2020.3000320
![]() |
[15] |
M. D. Grossberg, S. K. Nayar, Determining the camera response from images: What is knowable?, IEEE T. Pattern Anal., 25 (2003), 1455–1467. https://doi.org/10.1109/TPAMI.2003.1240119 doi: 10.1109/TPAMI.2003.1240119
![]() |
[16] | Y. Choi, M. Choi, M. Kim, J. W. Ha, S. Kim, J. Choo, StarGAN: Unified generative adversarial networks for multi-domain image-to-image translation, in Proceedings of the IEEE conference on computer vision and pattern recognition, 8789–8797, (2018). https://doi.org/10.48550/arXiv.1711.09020 |
[17] | J. Schneider, Domain transformer: Predicting samples of unseen, future domains, in 2022 International Joint Conference on Neural Networks (IJCNN), IEEE, 1–8, (2022). https://doi.org/10.1109/IJCNN55064.2022.9892250. |
[18] |
L. Yan, J. Feng, T. Hang, Y. Zhu, Flow interval prediction based on deep residual network and lower and upper boundary estimation method, Appl. Soft Comput., 104 (2021), 107228. https://doi.org/10.1016/j.asoc.2021.107228 doi: 10.1016/j.asoc.2021.107228
![]() |
[19] | P. Isola, J. Y. Zhu, T. Zhou, A. A. Efros, Image-to-image translation with conditional adversarial networks, in Proceedings of the IEEE conference on computer vision and pattern recognition, 1125–1134, (2017). https://doi.org/10.1109/CVPR.2017.632 |
[20] |
Y. J. Chen, L. C. Lin, S. T. Yang, K. S. Hwang, C. T. Liao, W. H. Ho, High-reliability non-contact photoplethysmography imaging for newborn care by a generative artificial intelligence, IEEE Access, 11 (2022), 90801–90810. https://doi.org/10.1109/ACCESS.2023.3307637 doi: 10.1109/ACCESS.2023.3307637
![]() |
[21] |
K. Yin, Z. Chen, H. Huang, D. Cohen-Or, H. Zhang, Logan: Unpaired shape transform in latent overcomplete space, ACM T. Graphic., 38 (2019), 1–13. https://doi.org/10.1145/3355089.3356494 doi: 10.1145/3355089.3356494
![]() |
[22] |
H. You, Y. Cheng, T. Cheng, C. Li, P. Zhou, Bayesian cycle-consistent generative adversarial networks via marginalizing latent sampling, IEEE T. Neur. Net. Learn. Syst., 32 (2020), 4389–4403. https://doi.org/10.1109/TNNLS.2020.3017669 doi: 10.1109/TNNLS.2020.3017669
![]() |
[23] | F. Campillo, V. Rossi, Convolution particle filter for parameter estimation in general state-space models, IEEE T. Aero. Elec. Syst., 45. https://doi.org/10.1109/TAES.2009.5259183 |
[24] | K. Vo, E. K. Naeini, A. Naderi, D. Jilani, A. M. Rahmani, N. Dutt, et al., P2e-wgan: Ecg waveform synthesis from PPG with conditional wasserstein generative adversarial networks, in Proceedings of the 36th Annual ACM Symposium on Applied Computing, 1030–1036, (2021). https://doi.org/10.1145/3412841.3441979 |
[25] |
G. Tsialiamanis, M. Champneys, N. Dervilis, D. J. Wagg, K. Worden, On the application of generative adversarial networks for nonlinear modal analysis, Mech. Syst. Signal Pr., 166 (2022), 108473. https://doi.org/10.1016/j.ymssp.2021.108473 doi: 10.1016/j.ymssp.2021.108473
![]() |
[26] |
S. A. Burns, Chromatic adaptation transform by spectral reconstruction, Color Res. Appl., 44 (2019), 682–693. https://doi.org/10.1002/col.22384 doi: 10.1002/col.22384
![]() |
[27] |
M. Störring, H. J. Andersen, E. Granum, Physics-based modelling of human skin colour under mixed illuminants, Robot. Auton. Syst., 35 (2001), 131–142. https://doi.org/10.1016/S0921-8890(01)00122-1 doi: 10.1016/S0921-8890(01)00122-1
![]() |
[28] |
X. Zhang, Q. Wang, J. Li, X. Zhou, Y. Yang, H. Xu, Estimating spectral reflectance from camera responses based on cie xyz tristimulus values under multi-illuminants, Color Res. Appl., 42 (2017), 68–77. https://doi.org/10.1002/col.22037 doi: 10.1002/col.22037
![]() |
[29] |
J. F. Galantowicz, D. Entekhabi, E. G. Njoku, Tests of sequential data assimilation for retrieving profile soil moisture and temperature from observed L-band radiobrightness, IEEE T. Geosci. Remote, 37 (1999), 1860–1870. https://doi.org/10.1109/36.774699 doi: 10.1109/36.774699
![]() |
[30] |
J. S. Liu, F. Liang, W. H. Wong, The multiple-try method and local optimization in metropolis sampling, J. Am. Stat. Assoc., 95 (2000), 121–134. https://doi.org/10.1080/01621459.2000.10473908 doi: 10.1080/01621459.2000.10473908
![]() |
[31] |
L. Martino, J. Read, D. Luengo, Independent doubly adaptive rejection metropolis sampling within gibbs sampling., IEEE T. Signal Proces., 63 (2015), 3123–3138. https://doi.org/10.1109/TSP.2015.2420537 doi: 10.1109/TSP.2015.2420537
![]() |
[32] | I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, et al., Generative adversarial nets, in Advances in neural information processing systems, 2672–2680, (2014). https://doi.org/10.48550/arXiv.1406.2661 |
[33] |
P. Vincent, Y. Bengio, Kernel matching pursuit, Mach. Learn., 48 (2002), 165–187. https://doi.org/10.1023/A:1013955821559 doi: 10.1023/A:1013955821559
![]() |
[34] |
G. Aneiros-Pérez, R. Cao, J. M. Vilar-Fernández, Functional methods for time series prediction: A nonparametric approach, J. Forecasting, 30 (2011), 377–392. https://doi.org/10.1002/for.1169 doi: 10.1002/for.1169
![]() |
[35] |
E. Masry, Nonparametric regression estimation for dependent functional data: asymptotic normality, Stoch. Proc. Appl., 115 (2005), 155–177. https://doi.org/10.1016/j.spa.2004.07.006 doi: 10.1016/j.spa.2004.07.006
![]() |
[36] |
H. Chun, S. Keleş, Sparse partial least squares regression for simultaneous dimension reduction and variable selection, J. Roy. Stat. Soc. B, 72 (2010), 3–25. https://doi.org/10.1111/j.1467-9868.2009.00723.x doi: 10.1111/j.1467-9868.2009.00723.x
![]() |
[37] |
G. Zhu, Z. Su, Envelope-based sparse partial least squares, Ann. Stat., 48 (2020), 161–182. https://doi.org/10.1214/18-AOS1796 doi: 10.1214/18-AOS1796
![]() |
[38] |
H. Zou, T. Hastie, Regularization and variable selection via the elastic net, J. Roy. Stat. Soc. B, 67 (2005), 301–320. https://doi.org/10.1111/j.1467-9868.2005.00503.x doi: 10.1111/j.1467-9868.2005.00503.x
![]() |
[39] |
E. J. Candes, T. Tao, Decoding by linear programming, IEEE T. Inf. Theory, 51 (2005), 4203–4215. https://doi.org/10.1109/TIT.2005.858979 doi: 10.1109/TIT.2005.858979
![]() |
[40] |
E. J. Candès, J. Romberg, T. Tao, Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information, IEEE T. Inf. Theory, 52 (2006), 489–509. https://doi.org/10.1109/TIT.2005.862083 doi: 10.1109/TIT.2005.862083
![]() |
[41] |
D. W. Marquardt, R. D. Snee, Ridge regression in practice, Am. Stat., 29 (1975), 3–20. https://doi.org/10.1080/00031305.1975.10479105 doi: 10.1080/00031305.1975.10479105
![]() |
[42] |
P. Exterkate, P. J. Groenen, C. Heij, D. van Dijk, Nonlinear forecasting with many predictors using kernel ridge regression, Int. J. Forecasting, 32 (2016), 736–753. https://doi.org/10.1016/j.ijforecast.2015.11.017 doi: 10.1016/j.ijforecast.2015.11.017
![]() |
[43] |
C. García, J. García, M. López Martín, R. Salmerón, Collinearity: Revisiting the variance inflation factor in ridge regression, J. Appl. Stat., 42 (2015), 648–661. https://doi.org/10.1080/02664763.2014.980789 doi: 10.1080/02664763.2014.980789
![]() |
[44] |
E. J. Candes, The restricted isometry property and its implications for compressed sensing, CR Math., 346 (2008), 589–592. https://doi.org/10.1016/j.crma.2008.03.014 doi: 10.1016/j.crma.2008.03.014
![]() |
[45] | M. Uzair, Z. Khan, A. Mahmood, F. Shafait, A. Mian, Uwa hyperspectral face database, 2023, https://dx.doi.org/10.21227/8714-kx37 |
1. | Alberto Saldaña, 2021, 775, 978-1-4704-6728-9, 255, 10.1090/conm/775/15595 | |
2. | Serena Dipierro, Luca Lombardini, Partial differential equations from theory to applications: Dedicated to Alberto Farina, on the occasion of his 50th birthday, 2023, 5, 2640-3501, 1, 10.3934/mine.2023050 | |
3. | Serena Dipierro, Giorgio Poggesi, Jack Thompson, Enrico Valdinoci, Quantitative stability for overdetermined nonlocal problems with parallel surfaces and investigation of the stability exponents, 2024, 188, 00217824, 273, 10.1016/j.matpur.2024.06.011 |
References | Feature | Description |
[19,39,40,41,42,43,44,45,46,47,48,49,50,51] | Mean | Mean=∑is2i/n |
[19,39,40,41,42,43,44,45,46,47,49,50,51] | StDev | sd=√N∑i=1(xi−μ)2/(N−1) |
[40,45] | Skew | G1=n(n−1)(n−2)n∑i=1(xi−¯xs)3 |
[19,40,43] | RMS | rmΔ(s)=√∑is2j/x |
[39,40,45,51] | Kurtosis | K=μ/σ |
[39,40,45] | Quart1 | Median of lower 25% of the values |
[39,40,45] | Quart3 | Median of up 25% of the values |
[39,40,45,51] | IRQ | Difference between Quart1 and Quart3 |
[41,42,44] | Sum | S=N∑i=1wi |
[45] | Variance | sd2=1/nn∑i=1(xi−μ)2 |
[43] | Coefficient of variation | CV=sd/μ |
[45] | Inverse coefficient of variation | ICV=μ/sd |
[18,19,41,43,46,47,50,51,52] | Frequency | Peak frequency, low-frequency power (LF), high-frequency power (HF), very-low-frequency power (VLF), prevalent low frequency (pLF), prevalent high frequency (pHF), the ratio of LF to HF (LF/HF), (From Lomb–Scargle) LF, HF and LF/HF |
Machine learning algorithms | Definition | Advantages | Disadvantages |
Support Vector Machine (SVM) | In order to better find the hyperplane that can divide the feature space, the low-dimensional data are mapped to the high-dimensional plane | Works well with any complex problem, can scale high-dimensional data, risk of overfitting is lower; It shows high robustness and strong generation ability with low overfitting | Sensitive to missing data; It is difficult to implement large-scale training samples; There are difficulties in solving the multi-classification problem |
K-Nearest Neighbor (KNN) | In the feature space, if k samples near a sample belong to a category, that sample also belongs to that category | Simple thought, mature theory; Can be used for multiple classification, high accuracy, no assumptions on the data, not sensitive to outliers | Poor generalization ability and slow prediction speed |
Random Forests (RF) | A classifier for multiple decision trees | It can handle high-dimension data, has strong model generalization ability, and is fast in training. It can balance errors in unbalanced data sets, maintain accuracy if a large part of features is lost | In noisy classification or regression problems, overfitting and attribute weights produced based on data of different attribute values are not credible |
Artificial Neural Network (ANN) | A mathematical model for distributed parallel information processing that mimics the features of human neural network | It has a high accuracy of classification, strong parallel distribution processing ability, strong distribution storage and learning ability, adaptability, strong robustness and fault tolerance to noise nerve | Many parameters are required, which are not suitable for small samples, high calculation cost, long learning process, and intermediate results cannot be observed |
Logical Regression (LR) | The result of linear regression is mapped between 0 and 1 by a sigmoid function. The result of the mapping is the probability that the data sample point belongs to a certain category | The computational cost is low, the implementation is easy to understand, the robustness to small noise in the data is good, and it is not particularly affected by slight multicollinearity | It is easy to underfit, and the performance effect is not good when the classification accuracy is not high, the data features are missing or the feature space is large |
Decision Tree (DT) | A prediction model is a mapping between object attributes and object values | Easy to understand and explain, visualization of tree structure, less data required for training, fast analysis speed, and strong ability to process samples with missing attributes | Overfitting, poor generalization ability and unstable calculation results |
Bayesian Network (BN) | Graphical networks based on probabilistic inference | The process is simple and fast, efficient for multi-classification problems, and requires a smaller sample size | Sample attributes do not work well when they are associated and are sensitive to input forms |
Linear Discriminant Analysis (LDA) | The labeled data is projected to a lower dimensional space so that the points after projection are distinguished by category. Points of the same category are closer to each other in the space after projection, and the differences are as far as possible | It can reflect the differences between samples and use the prior knowledge experience of categories in dimension reduction | Easy to fit, restricted by sample type |
Principal Component Analysis (PCA) | A method of multivariate statistical analysis in which several variables are transformed linearly to select a smaller number of important variables | Eliminate the mutual influence between indicators, small workload | The amount of information must be kept at a high level after dimensionality reduction, and the interpretation of principal components is usually fuzzy |
Study | Type | Device | Signals | Model | Result |
Can et al., 2019 [51] | Stress | Samsung Gear S, S2 and Empatica E4 wristband | End Diastolic Velocity, HRV and ACC | PCA, LDA, SVM, KNN, RF and LR | The accuracy of the multi-layer perceptron algorithm is 92.19%. |
Betti et al., 2018 [18] | Stress | Mobile EEG headset, Chest belt | HRV, EDA, EEG and Salivary cortisol | SVM | The classification algorithm based on salient features has an accuracy of 86%. |
Egilmez et al., 2017 [40] | Stress | LG smartwatch | GSR | RF | The pressure detection F-value of the system is 88.8%. |
Gjoreski et al., 2019 [39] | Stress | Empatica device's wristband | HR, BVP, IBI, ST and ACC | SVM | Experiments with 55 days of real data showed that the method detected stress events with 95% accuracy. |
Ahn et al., 2019 [52] | Stress | Head-mounted electrode | ECG, EEG | SVM | The accuracy of the classification of pressure states using SVM technology and the 5-fold cross validation of EEG and HRV features was 87.5%. |
Wu et al., 2019 [50] | Stress | Smart shirt | Cortisol, HRV | LR | Mental stress index is sensitive to acute stress and can predict the level of association between normal individuals and the stress group with about 97% accuracy. |
Jesmin et al., 2020 [53] | Stress | Smartwatch | HRV, GSR and SpO2 | SVM, KNN | Multi-sensor data fusion can better monitor individual stress levels. |
Bobade et al., 2020 [41] |
Stress | Chest-worn device, RespiBAN Professional and a wrist-worn device Empatica E4 | ACC, ECG, BVP, ST, RR, EMG and EDA | Machine Learning |
Using machine learning technology, the accuracy of Class Ⅲ and Class Ⅱ classification problems reached 81.65 and 93.20%, respectively, and using deep learning technology, the accuracy of Class Ⅲ and Class Ⅱ classification problems reached 84.32 and 95.21%, respectively. |
Silva et al., 2020 [49] | Stress | Smartwatch | HRV | ANN | Several parameters of the HRV under pressure were significantly different from baseline. |
Kim et al., 2020 [42] | Stress | Empatica E4 wristband | GSR | LR | The overall classification accuracy of the model was 85.3%, and the cross-validation accuracy was 83.2%. |
Han et al., 2020 [48] | Stress | Smart wrist band, Empatica E4 wristband and a Shimmer ECG device | ECG, PPG and GSR | KNN, SVM, and BN | The accuracy of 10-fold cross-validation was 94.55%, while the accuracy of subject cross-validation was 85.71%. In daily settings, the system had 81.82% accuracy in assessing stress. |
Tonacci et al., 2019 [62] | Stress | Shimmer™, Shimmer3™ | GSR, ECG | Pan Tompkins algorithm | HR increased with both sensory (olfactory) and cognitive stimulation, and there was a similar decrease in HRV. |
Tiwari et al., 2019 [43] | Stress and anxiety |
OMsignal smart-shirt and a Fitbit Charge 2 smart-bracelet | HRV | SVM | Overall, HRV characteristics calculated from wearable vest data (with higher temporal resolution) proved to be more effective in predicting stress and anxiety. |
Patlar Akbulut et al., 2020 [47] | Stress and anxiety |
Wrist-worn wearable device | ECG, GSR, ST, SpO2, Blood sugar and BP | PCA, Feed-forward neural network | Method had an average accuracy of 92 and 89% in distinguishing the stress levels of the experimental group and the control group, respectively. |
Sano et al., 2018 [54] | Stress and mental health condition | Q-sensor and Motion logger | SC, ST, ACC and Motion logger | SVM, LASSO and RBF | The sensors were 78.3% accurate in classifying students as high or low stress groups, and 87% accurate in classifying students as high or low mental health groups. |
Fraiwan et al., 2018 [65] | Mental health condition | Smart glove | ST and HRV | HRV algorithm | There was significant variability in skin resistance and HRV under relaxation and stimulation. |
Chen et al., 2018 [68] | Mental health condition | Smartwatch | Speech | Transfer learning | The use of wearable long-term speech-to-social data sets to categorize various sound scenes in unconstrained and natural environments yielded good results. |
Wen et al., 2020 [44] | Anxiety | The MP150 multi-channel physiological recorder | HR (ECG electrodes) | SVM | The classifier achieved 81.82% accuracy in detecting a high anxiety state. |
O'Brien et al., 2017[56] | Depression |
Wrist-mounted device |
ACC | LR | General physical activity was significantly reduced in depressed older adults compared with healthy controls. |
Narziev et al., 2020 [67] | Depression |
Smartwatch | Motion logger and HR | Machine learning |
Participants' ecological transient assessment self-report and passive perception (sensor data) were highly correlated between physical activity, mood and sleep levels; Depression detector verified the feasibility of grouping and classification with an accuracy rate of 96.00% (StDev = 2.76). |
Zanella-Calzada et al., 2019 [45] | Depression |
Actiwatch | Motion Logger | RF | Patients with depression were accurately detected in 86.7% of the cases, while those without depression were found in 91.9% of the cases. |
Gentili et al., 2017 [55] | Bipolar disorder | Sensorized t-shirt | HRV, respiration and ACC | SVM | After comparing the HRV features of the emotional states before and after, the classification accuracy of the SVM in the target state was significantly improved. |
Zheng et al., 2019 [46] | Emotion recognition |
Head/wearable eye-tracking glasses, EmotionMeter hardware | EEG and Eye-movement signal | Multi-mode deep neural network model | The modal fusion performance of the multi-mode deep neural network was significantly improved, and the average accuracy of the four emotions (happiness, sadness, fear and neutral) reached 85.11%. |
Goumopoulos et al., 2022 [19] | Mental fatigue | Zephyr HXM-BT, Chest-strap built-in ECG sensor | HRV | SVM, KNN and LR | Mental fatigue caused by cognitive load can be detected by wearable commercial devices and a single biomarker. |
Qiu et al., 2020 [38] | Mental fatigue | Head-mounted devices | ECG, EEG and ST | Wavelet transform, Kalman filtering | Through the fatigue evaluation experiment of 10 healthy subjects, it was verified that the device can reliably detect the mental fatigue state by monitoring and analyzing EEG, ECG and proximal ST data. |
References | Feature | Description |
[19,39,40,41,42,43,44,45,46,47,48,49,50,51] | Mean | Mean=∑is2i/n |
[19,39,40,41,42,43,44,45,46,47,49,50,51] | StDev | sd=√N∑i=1(xi−μ)2/(N−1) |
[40,45] | Skew | G1=n(n−1)(n−2)n∑i=1(xi−¯xs)3 |
[19,40,43] | RMS | rmΔ(s)=√∑is2j/x |
[39,40,45,51] | Kurtosis | K=μ/σ |
[39,40,45] | Quart1 | Median of lower 25% of the values |
[39,40,45] | Quart3 | Median of up 25% of the values |
[39,40,45,51] | IRQ | Difference between Quart1 and Quart3 |
[41,42,44] | Sum | S=N∑i=1wi |
[45] | Variance | sd2=1/nn∑i=1(xi−μ)2 |
[43] | Coefficient of variation | CV=sd/μ |
[45] | Inverse coefficient of variation | ICV=μ/sd |
[18,19,41,43,46,47,50,51,52] | Frequency | Peak frequency, low-frequency power (LF), high-frequency power (HF), very-low-frequency power (VLF), prevalent low frequency (pLF), prevalent high frequency (pHF), the ratio of LF to HF (LF/HF), (From Lomb–Scargle) LF, HF and LF/HF |
Machine learning algorithms | Definition | Advantages | Disadvantages |
Support Vector Machine (SVM) | In order to better find the hyperplane that can divide the feature space, the low-dimensional data are mapped to the high-dimensional plane | Works well with any complex problem, can scale high-dimensional data, risk of overfitting is lower; It shows high robustness and strong generation ability with low overfitting | Sensitive to missing data; It is difficult to implement large-scale training samples; There are difficulties in solving the multi-classification problem |
K-Nearest Neighbor (KNN) | In the feature space, if k samples near a sample belong to a category, that sample also belongs to that category | Simple thought, mature theory; Can be used for multiple classification, high accuracy, no assumptions on the data, not sensitive to outliers | Poor generalization ability and slow prediction speed |
Random Forests (RF) | A classifier for multiple decision trees | It can handle high-dimension data, has strong model generalization ability, and is fast in training. It can balance errors in unbalanced data sets, maintain accuracy if a large part of features is lost | In noisy classification or regression problems, overfitting and attribute weights produced based on data of different attribute values are not credible |
Artificial Neural Network (ANN) | A mathematical model for distributed parallel information processing that mimics the features of human neural network | It has a high accuracy of classification, strong parallel distribution processing ability, strong distribution storage and learning ability, adaptability, strong robustness and fault tolerance to noise nerve | Many parameters are required, which are not suitable for small samples, high calculation cost, long learning process, and intermediate results cannot be observed |
Logical Regression (LR) | The result of linear regression is mapped between 0 and 1 by a sigmoid function. The result of the mapping is the probability that the data sample point belongs to a certain category | The computational cost is low, the implementation is easy to understand, the robustness to small noise in the data is good, and it is not particularly affected by slight multicollinearity | It is easy to underfit, and the performance effect is not good when the classification accuracy is not high, the data features are missing or the feature space is large |
Decision Tree (DT) | A prediction model is a mapping between object attributes and object values | Easy to understand and explain, visualization of tree structure, less data required for training, fast analysis speed, and strong ability to process samples with missing attributes | Overfitting, poor generalization ability and unstable calculation results |
Bayesian Network (BN) | Graphical networks based on probabilistic inference | The process is simple and fast, efficient for multi-classification problems, and requires a smaller sample size | Sample attributes do not work well when they are associated and are sensitive to input forms |
Linear Discriminant Analysis (LDA) | The labeled data is projected to a lower dimensional space so that the points after projection are distinguished by category. Points of the same category are closer to each other in the space after projection, and the differences are as far as possible | It can reflect the differences between samples and use the prior knowledge experience of categories in dimension reduction | Easy to fit, restricted by sample type |
Principal Component Analysis (PCA) | A method of multivariate statistical analysis in which several variables are transformed linearly to select a smaller number of important variables | Eliminate the mutual influence between indicators, small workload | The amount of information must be kept at a high level after dimensionality reduction, and the interpretation of principal components is usually fuzzy |
Study | Type | Device | Signals | Model | Result |
Can et al., 2019 [51] | Stress | Samsung Gear S, S2 and Empatica E4 wristband | End Diastolic Velocity, HRV and ACC | PCA, LDA, SVM, KNN, RF and LR | The accuracy of the multi-layer perceptron algorithm is 92.19%. |
Betti et al., 2018 [18] | Stress | Mobile EEG headset, Chest belt | HRV, EDA, EEG and Salivary cortisol | SVM | The classification algorithm based on salient features has an accuracy of 86%. |
Egilmez et al., 2017 [40] | Stress | LG smartwatch | GSR | RF | The pressure detection F-value of the system is 88.8%. |
Gjoreski et al., 2019 [39] | Stress | Empatica device's wristband | HR, BVP, IBI, ST and ACC | SVM | Experiments with 55 days of real data showed that the method detected stress events with 95% accuracy. |
Ahn et al., 2019 [52] | Stress | Head-mounted electrode | ECG, EEG | SVM | The accuracy of the classification of pressure states using SVM technology and the 5-fold cross validation of EEG and HRV features was 87.5%. |
Wu et al., 2019 [50] | Stress | Smart shirt | Cortisol, HRV | LR | Mental stress index is sensitive to acute stress and can predict the level of association between normal individuals and the stress group with about 97% accuracy. |
Jesmin et al., 2020 [53] | Stress | Smartwatch | HRV, GSR and SpO2 | SVM, KNN | Multi-sensor data fusion can better monitor individual stress levels. |
Bobade et al., 2020 [41] |
Stress | Chest-worn device, RespiBAN Professional and a wrist-worn device Empatica E4 | ACC, ECG, BVP, ST, RR, EMG and EDA | Machine Learning |
Using machine learning technology, the accuracy of Class Ⅲ and Class Ⅱ classification problems reached 81.65 and 93.20%, respectively, and using deep learning technology, the accuracy of Class Ⅲ and Class Ⅱ classification problems reached 84.32 and 95.21%, respectively. |
Silva et al., 2020 [49] | Stress | Smartwatch | HRV | ANN | Several parameters of the HRV under pressure were significantly different from baseline. |
Kim et al., 2020 [42] | Stress | Empatica E4 wristband | GSR | LR | The overall classification accuracy of the model was 85.3%, and the cross-validation accuracy was 83.2%. |
Han et al., 2020 [48] | Stress | Smart wrist band, Empatica E4 wristband and a Shimmer ECG device | ECG, PPG and GSR | KNN, SVM, and BN | The accuracy of 10-fold cross-validation was 94.55%, while the accuracy of subject cross-validation was 85.71%. In daily settings, the system had 81.82% accuracy in assessing stress. |
Tonacci et al., 2019 [62] | Stress | Shimmer™, Shimmer3™ | GSR, ECG | Pan Tompkins algorithm | HR increased with both sensory (olfactory) and cognitive stimulation, and there was a similar decrease in HRV. |
Tiwari et al., 2019 [43] | Stress and anxiety |
OMsignal smart-shirt and a Fitbit Charge 2 smart-bracelet | HRV | SVM | Overall, HRV characteristics calculated from wearable vest data (with higher temporal resolution) proved to be more effective in predicting stress and anxiety. |
Patlar Akbulut et al., 2020 [47] | Stress and anxiety |
Wrist-worn wearable device | ECG, GSR, ST, SpO2, Blood sugar and BP | PCA, Feed-forward neural network | Method had an average accuracy of 92 and 89% in distinguishing the stress levels of the experimental group and the control group, respectively. |
Sano et al., 2018 [54] | Stress and mental health condition | Q-sensor and Motion logger | SC, ST, ACC and Motion logger | SVM, LASSO and RBF | The sensors were 78.3% accurate in classifying students as high or low stress groups, and 87% accurate in classifying students as high or low mental health groups. |
Fraiwan et al., 2018 [65] | Mental health condition | Smart glove | ST and HRV | HRV algorithm | There was significant variability in skin resistance and HRV under relaxation and stimulation. |
Chen et al., 2018 [68] | Mental health condition | Smartwatch | Speech | Transfer learning | The use of wearable long-term speech-to-social data sets to categorize various sound scenes in unconstrained and natural environments yielded good results. |
Wen et al., 2020 [44] | Anxiety | The MP150 multi-channel physiological recorder | HR (ECG electrodes) | SVM | The classifier achieved 81.82% accuracy in detecting a high anxiety state. |
O'Brien et al., 2017[56] | Depression |
Wrist-mounted device |
ACC | LR | General physical activity was significantly reduced in depressed older adults compared with healthy controls. |
Narziev et al., 2020 [67] | Depression |
Smartwatch | Motion logger and HR | Machine learning |
Participants' ecological transient assessment self-report and passive perception (sensor data) were highly correlated between physical activity, mood and sleep levels; Depression detector verified the feasibility of grouping and classification with an accuracy rate of 96.00% (StDev = 2.76). |
Zanella-Calzada et al., 2019 [45] | Depression |
Actiwatch | Motion Logger | RF | Patients with depression were accurately detected in 86.7% of the cases, while those without depression were found in 91.9% of the cases. |
Gentili et al., 2017 [55] | Bipolar disorder | Sensorized t-shirt | HRV, respiration and ACC | SVM | After comparing the HRV features of the emotional states before and after, the classification accuracy of the SVM in the target state was significantly improved. |
Zheng et al., 2019 [46] | Emotion recognition |
Head/wearable eye-tracking glasses, EmotionMeter hardware | EEG and Eye-movement signal | Multi-mode deep neural network model | The modal fusion performance of the multi-mode deep neural network was significantly improved, and the average accuracy of the four emotions (happiness, sadness, fear and neutral) reached 85.11%. |
Goumopoulos et al., 2022 [19] | Mental fatigue | Zephyr HXM-BT, Chest-strap built-in ECG sensor | HRV | SVM, KNN and LR | Mental fatigue caused by cognitive load can be detected by wearable commercial devices and a single biomarker. |
Qiu et al., 2020 [38] | Mental fatigue | Head-mounted devices | ECG, EEG and ST | Wavelet transform, Kalman filtering | Through the fatigue evaluation experiment of 10 healthy subjects, it was verified that the device can reliably detect the mental fatigue state by monitoring and analyzing EEG, ECG and proximal ST data. |