Research article Special Issues

EMD-based analysis of complexity with dissociated EEG amplitude and frequency information: a data-driven robust tool -for Autism diagnosis- compared to multi-scale entropy approach

  • Objective: Autism spectrum disorder (ASD) is usually characterised by altered social skills, repetitive behaviours, and difficulties in verbal/nonverbal communication. It has been reported that electroencephalograms (EEGs) in ASD are characterised by atypical complexity. The most commonly applied method in studies of ASD EEG complexity is multiscale entropy (MSE), where the sample entropy is evaluated across several scales. However, the accuracy of MSE-based classifications between ASD and neurotypical EEG activities is poor owing to several shortcomings in scale extraction and length, the overlap between amplitude and frequency information, and sensitivity to frequency. The present study proposes a novel, nonlinear, non-stationary, adaptive, data-driven, and accurate method for the classification of ASD and neurotypical groups based on EEG complexity and entropy without the shortcomings of MSE. Approach: The proposed method is as follows: (a) each ASD and neurotypical EEG (122 subjects × 64 channels) is decomposed using empirical mode decomposition (EMD) to obtain the intrinsic components (intrinsic mode functions). (b) The extracted components are normalised through the direct quadrature procedure. (c) The Hilbert transforms of the components are computed. (d) The analytic counterparts of components (and normalised components) are found. (e) The instantaneous frequency function of each analytic normalised component is calculated. (f) The instantaneous amplitude function of each analytic component is calculated. (g) The Shannon entropy values of the instantaneous frequency and amplitude vectors are computed. (h) The entropy values are classified using a neural network (NN). (i) The achieved accuracy is compared to that obtained with MSE-based classification. (j) The consistency of the results of entropy 3D mapping with clinical data is assessed. Main results: The results demonstrate that the proposed method outperforms MSE (accuracy: 66.4%), with an accuracy of 93.5%. Moreover, the entropy 3D mapping results are more consistent with the available clinical data regarding brain topography in ASD. Significance: This study presents a more robust alternative to MSE, which can be used for accurate classification of ASD/neurotypical as well as for the examination of EEG entropy across brain zones in ASD.

    Citation: Enas Abdulhay, Maha Alafeef, Hikmat Hadoush, V. Venkataraman, N. Arunkumar. EMD-based analysis of complexity with dissociated EEG amplitude and frequency information: a data-driven robust tool -for Autism diagnosis- compared to multi-scale entropy approach[J]. Mathematical Biosciences and Engineering, 2022, 19(5): 5031-5054. doi: 10.3934/mbe.2022235

    Related Papers:

    [1] Kunli Zhang, Bin Hu, Feijie Zhou, Yu Song, Xu Zhao, Xiyang Huang . Graph-based structural knowledge-aware network for diagnosis assistant. Mathematical Biosciences and Engineering, 2022, 19(10): 10533-10549. doi: 10.3934/mbe.2022492
    [2] Zhi Yang, Kang Li, Haitao Gan, Zhongwei Huang, Ming Shi, Ran Zhou . An Alzheimer's Disease classification network based on MRI utilizing diffusion maps for multi-scale feature fusion in graph convolution. Mathematical Biosciences and Engineering, 2024, 21(1): 1554-1572. doi: 10.3934/mbe.2024067
    [3] Shi Liu, Kaiyang Li, Yaoying Wang, Tianyou Zhu, Jiwei Li, Zhenyu Chen . Knowledge graph embedding by fusing multimodal content via cross-modal learning. Mathematical Biosciences and Engineering, 2023, 20(8): 14180-14200. doi: 10.3934/mbe.2023634
    [4] Shuwei Zhu, Wenping Wang, Wei Fang, Meiji Cui . Autoencoder-assisted latent representation learning for survival prediction and multi-view clustering on multi-omics cancer subtyping. Mathematical Biosciences and Engineering, 2023, 20(12): 21098-21119. doi: 10.3934/mbe.2023933
    [5] Hanming Zhai, Xiaojun Lv, Zhiwen Hou, Xin Tong, Fanliang Bu . MLSFF: Multi-level structural features fusion for multi-modal knowledge graph completion. Mathematical Biosciences and Engineering, 2023, 20(8): 14096-14116. doi: 10.3934/mbe.2023630
    [6] Changwei Gong, Bing Xue, Changhong Jing, Chun-Hui He, Guo-Cheng Wu, Baiying Lei, Shuqiang Wang . Time-sequential graph adversarial learning for brain modularity community detection. Mathematical Biosciences and Engineering, 2022, 19(12): 13276-13293. doi: 10.3934/mbe.2022621
    [7] Yuanhong Jiang, Yiqing Shen, Yuguang Wang, Qiaoqiao Ding . Automatic recognition of white blood cell images with memory efficient superpixel metric GNN: SMGNN. Mathematical Biosciences and Engineering, 2024, 21(2): 2163-2188. doi: 10.3934/mbe.2024095
    [8] Hongqiang Zhu . A graph neural network-enhanced knowledge graph framework for intelligent analysis of policing cases. Mathematical Biosciences and Engineering, 2023, 20(7): 11585-11604. doi: 10.3934/mbe.2023514
    [9] Zijian Wang, Yaqin Zhu, Haibo Shi, Yanting Zhang, Cairong Yan . A 3D multiscale view convolutional neural network with attention for mental disease diagnosis on MRI images. Mathematical Biosciences and Engineering, 2021, 18(5): 6978-6994. doi: 10.3934/mbe.2021347
    [10] Zulin Xu . An intelligent fault detection approach for digital integrated circuits through graph neural networks. Mathematical Biosciences and Engineering, 2023, 20(6): 9992-10006. doi: 10.3934/mbe.2023438
  • Objective: Autism spectrum disorder (ASD) is usually characterised by altered social skills, repetitive behaviours, and difficulties in verbal/nonverbal communication. It has been reported that electroencephalograms (EEGs) in ASD are characterised by atypical complexity. The most commonly applied method in studies of ASD EEG complexity is multiscale entropy (MSE), where the sample entropy is evaluated across several scales. However, the accuracy of MSE-based classifications between ASD and neurotypical EEG activities is poor owing to several shortcomings in scale extraction and length, the overlap between amplitude and frequency information, and sensitivity to frequency. The present study proposes a novel, nonlinear, non-stationary, adaptive, data-driven, and accurate method for the classification of ASD and neurotypical groups based on EEG complexity and entropy without the shortcomings of MSE. Approach: The proposed method is as follows: (a) each ASD and neurotypical EEG (122 subjects × 64 channels) is decomposed using empirical mode decomposition (EMD) to obtain the intrinsic components (intrinsic mode functions). (b) The extracted components are normalised through the direct quadrature procedure. (c) The Hilbert transforms of the components are computed. (d) The analytic counterparts of components (and normalised components) are found. (e) The instantaneous frequency function of each analytic normalised component is calculated. (f) The instantaneous amplitude function of each analytic component is calculated. (g) The Shannon entropy values of the instantaneous frequency and amplitude vectors are computed. (h) The entropy values are classified using a neural network (NN). (i) The achieved accuracy is compared to that obtained with MSE-based classification. (j) The consistency of the results of entropy 3D mapping with clinical data is assessed. Main results: The results demonstrate that the proposed method outperforms MSE (accuracy: 66.4%), with an accuracy of 93.5%. Moreover, the entropy 3D mapping results are more consistent with the available clinical data regarding brain topography in ASD. Significance: This study presents a more robust alternative to MSE, which can be used for accurate classification of ASD/neurotypical as well as for the examination of EEG entropy across brain zones in ASD.



    Electronic health records (EHRs) are generated by hospitals to maintain real-time digital records of patients, including clinical events, diagnosis, and treatment information of patients during the outpatient and inpatient periods. EHRs are composed of different structured [1] and unstructured data, including demographic information, diagnosis information, laboratory results, medication prescriptions, medical imaging, and text descriptions, for example, as free clinical notes. Clinical assistant diagnosis is a fundamental task in smart healthcare, aiming to automatically extract valuable information from EHRs and make correct and intelligent diagnostic decisions, which improves the diagnostic efficiency of clinicians and reduces misdiagnosis. It is important in personalized medicine, disease prediction, and other tasks.

    Although EHRs have been widely studied and have greatly improved clinical decision support [2], there are still some challenges for clinical assistant diagnosis based on EHRs. On the one hand, EHRs contain multiple types of data such as population information and laboratory tests, which make EHRs highly dimensional and heterogeneous, and it is challenging to create large-scale computational models from EHRs. On the other hand, an EHR is incomplete and noisy, because some hospitals are still further standardizing the unified writing standard of EHRs and patient follow-up records [3].

    Recently, GNNs have been used extensively for clinical assistant diagnosis with significant research findings [1,2,3,4]. However, there are still three obstacles. First, most of the GNNs based on homogenous graph models focus on the co-occurrence of various medical concepts. Furthermore, although there are a few heterogeneous graph models based on EHRs, each relation in these models can represent only the local information of the heterogeneous graph. Combining different relationships forms a large amount of higher-order semantic information. Treating these heterogeneous relations uniformly is impossible, which limits the learning ability of GNNs. Finally, GNNs are highly sensitive to the original topology of information transmission in smart healthcare applications. An artificially designed heterogeneous graph model is usually extracted from structured and unstructured medical text data, which inevitably contain redundant, erroneous, and missing information. Consequently, it is difficult to deal with the challenges of structure learning and optimization in a heterogeneous graph model based on EHRs.

    We propose a new model, PM-GSL, which introduces graph structure learning into diabetes clinical assistant diagnosis, and abstractly describes the semantic relations between entities in EHRs through patient multi-relational graph. PM-GSL can not only learn the structure of heterograph, but also learn the parameters of GNNs to overcome the weak structure problem of patient multi-relational graph. The contributions of this study are summarized as follows:

    1) We apply graph structure learning to diabetes clinical assistant diagnosis and use EHRs to construct a patient multi-relational graph, which abstractly describes the relation between entities in EHRs from two perspectives: node attributes and meta-paths.

    2) We propose a novel diabetes clinical assistant diagnosis model, PM-GSL. In this model, we design three candidate graphs, original subgraph, overall feature graph, and higher-order semantic graph, and fuse the three candidate graphs to generate a higher quality heterogeneous graph to obtain the complex interactive higher-order semantic information of the patient multi-relational graph.

    3) We also conduct multiple experiments to compare the results with the existing state-of-the-art approach on two standard EHRs datasets: the large international public dataset MIMIC-Ⅳ and the Chinese hospital dataset P-EHRs. Our proposed PM-GSL model achieved a 9.62% improvement in Macro-F1, 9.17% improvement in Micro-F1 and 10.33% in AUC compared to the state-of-the-art models for diabetes clinical assistant diagnosis tasks.

    The remainder of this paper is organized as follows. Section 2 describes related work, where we discuss deep learning models, such as GNNs, disease assistant diagnosis, and graph structure learning, which motivate this work. Section 3 describes the problem setting of diabetes clinical assistant diagnosis and presents the related definitions. Section 4 details the PM-GSL model. Section 5 demonstrates the results of our empirical evaluation of the PM-GSL on the MIMIC-Ⅳ and P-EHRs datasets. Finally, Section 6 provides concluding remarks and research directions.

    GNNs were first proposed by Scarselli et al. [5] to further extend deep learning to graph-structured data, which mainly contains two types: spectral domain [6,7] and spatial domain [8,9]. The method based on the spectral domain adopts the spectral representation of the graph [10]. The spatial domain-based methods define convolution directly on the graph and aggregate feature information for each node from the spatial neighborhood, such as GraphSAGE [11] and GAT [12]. However, the above graph neural network can only be used to process a homogeneous graph. Some recent studies have attempted to extend GNNs to heterogeneous graphs, such as metapath2vec [13], which transforms heterogeneous graphs into homogeneous graph studies using random wandering based on meta-paths to learn graph representations.

    The HAN [14] model utilizes the graph attention network method for the modeling and analysis of a heterogeneous graph, aggregating attribute information from meta-path-based neighbor nodes. Based on the HAN, the MAGNN [15] model combines the internal aggregation of meta-paths and aggregation between meta-paths. GNNs have powerful capabilities in learning node embedding representations and have achieved significant performance in numerous specific tasks [16]. However, almost all GNNs regard observed data containing noise or hypothetical data that are convenient for modeling as real information, which significantly depends on the quality of the original graph structure, greatly limiting their ability to handle the uncertainty in the graph structure.

    Graph structure learning (GSL) can be traced back to several research achievements devoted to graph structure learning in network science [17,18]. The GSL handles the noise and incompleteness problems in the original graph data. Some existing methods learn graph structure from measurements of graph dynamical systems, such as coupled oscillators [19], and there are also studies [10] that set up attribute completion strategies for attribute missing problems. Nevertheless, the goal of these findings is different from graph representation learning. Recent studies have attempted to combine GSL with GNNs to improve the performance of downstream tasks. Franceschi et al. proposed a new method, LDS [20], to jointly learn and optimize the original graph and GNN parameters by approximating the solution.

    The Pro-GNN [21] aims to extract the graph properties of sparsity, low rank and feature smoothness to design more robust GNNs that learn clean and efficient graph structures. The GTN [22] model can generate a new graph structure by identifying valuable edges that may exist as unconnected nodes on the original graph and can learn the effective embedded representation of nodes on the new graph from end to end. However, most GSL models are designed for homogeneous and smaller graphs. They modify the graph structure by removing noisy data from the graph topology and graph attribute similarity. Such models tend to optimize the entire graph [23], and it is difficult to analyze large-scale graph data with complex structures and rich semantics in the medical field.

    Graph-based methods have been widely used in auxiliary disease diagnosis tasks because of the powerful expression ability of graphs in relation modeling. In [3], a comprehensive disease self-diagnosis system based on the HealGCN model was proposed, which uses the predefined meta-paths inductive graph convolution operation to generate the embedded representation of new patients and solve the cold start problem by mining the complex interactions in EHRs data. The MVS-GCN [24] model, based on a graph neural network, learns the effective representation of the brain network in an end-to-end manner and combines graph structure learning and multitask graph embedding representation learning, thus improving the classification performance of brain disease diagnosis.

    MM-STGNN [25] is a multimodal spatiotemporal graph neural network that uses graphs to represent the topological relationship between inpatients and predicts 30-day readmission according to patients' longitudinal imaging and EHRs data. In [26], the authors proposed an end-to-end multi-modal graph learning framework (MMGL) for disease prediction with multi-modality. And it aggregates the features of each modality by leveraging the correlation and complementarity between the modalities. Although GNN models have shown excellent predictive performance in numerous healthcare tasks, they are highly sensitive to the quality of graph structures. If a manually constructed graph is directly used in the GNNs and separated from the prediction module, it will lead to cumbersome tuning and poor generalization [27]. Simultaneously, these models failed to further explore the complementary information between multiple relations in the graph and the message propagation mechanism in the multi-relational subgraph.

    Diabetes, glaucoma and central nervous system diseases (such as stroke) are common in middle-aged and elderly people. Their early symptoms are extremely similar and are difficult to distinguish clinically [28]. For example, in a diabetes patient with a wide range of blood glucose fluctuations, the osmotic pressure inside and outside the lens changes, causing a change in the curvature of the lens and affecting the patient's ability to focus the eye, thus producing blurred vision symptoms. Although blurred vision symptoms may be due to diabetes, glaucoma can also cause eye diopter changes in patients, resulting in lower intraocular pressure and causing blurred vision. In addition, stroke patients often show symptoms, such as blurred vision, sudden pins and needle sensations, and fatigue, which are similar to those of diabetic patients. This differs from many previous studies that modeled disease diagnosis as a link-prediction task. We use patient demographics, laboratory tests, physiological records, and medications in EHRs as nodes in the multi-relational graph to transform the disease prediction task into a classification task with fine-grained analysis.

    The patient multi-relational graph is defined as G=(V,E,X,{ξγ}Rr=1), where V is the node set {v1,v2,...vN}. We let N denote the number of nodes, and the corresponding node type is M. E is the set of edges, all edges form a primitive adjacency matrix XRN×N, where Xij indicates whether there is an edge between nodes vi and vj, edge relation type is ξγ, r{1,2...,R}, and the multi-relational graph satisfies the inequality |M|+|R|>2. The eigenvector of each node is fiRd, and all the node feature matrices are represented as X={f1,f2...fN}RN×d.

    In the patient multi-relational graph, two nodes can be connected by different semantic patterns. The meta-path ϕ is formalized as v1r1v2r2...rl1vl, which is used to describe the composite relation r1r2...rl1 between nodes v1 and vl, where is a relational synthesis operation.

    The patient multi-relational subgraph Gr contains all node-pair triples with relation r, which is formalized as (vi,r,vj). The adjacency matrix of a multi-relational subgraph is expressed as Ar, and if (vi,r,vj)Gr, then Arij=1; otherwise, Arij=0. A={Ar,rR} represents the set of all patient multi-relational subgraphs. For example, we define the type mapping functions ϖh(r) and ϖe(r) of the node to the triplet header and tail node in a patient medical record network. If r= "pt" is interpreted as ϖh(r)= "patient, " ϖe(r)= "test." The adjacency matrix of the patient multi-relational subgraph is expressed as ArR|ϖh(r)|×|ϖe(r)|.

    In this study, the EHRs are modeled as a patient multi-relational graph, where each patient is considered a node, and their EHRs data are associated with multiple medical concepts. For example, the following information is extracted from a record in the EHRs: if a patient's fasting blood glucose exceeded 7.0 mmol/L or postprandial blood glucose exceeded 11.1 mmol/L in the glucose tolerance test, the doctor identified abnormal blood glucose in the diagnosis conclusion, of which "fasting blood glucose" and "postprandial blood glucose" are medical concepts. We reduce the values corresponding to similar or identical medical concepts to smaller categories and set the nodes "normal" and "abnormal" for the patient's laboratory tests. Based on [28], patients' ages were grouped according to three thresholds of 15, 30 and 64, set as four nodes of representation. Over 100 medications are extracted from EHRs, which are divided into 20 types according to different abbreviations, formulations and their effects, with each type of medication considered as a node. The comorbidity diagnoses of the patients matched the appropriate ICD-10 codes.

    In the diabetes diagnosis task, gender is not the most informative patient attribute, which is not conducive to the prediction task and may cause the problem of "over smoothing." According to the conclusion of [4], the PM-GSL model disregards gender attributes. We construct the multi-relational graph using different relations between five types of nodes, "Patient, " "Age, " "Medication, " "Diagnoses" and " Laboratory Tests, " and determine semantically meaningful meta-paths. For example, two patients with blurred vision, "patient1 metformin patient2, " indicate that two patients had the same medication, and two patients were likely to have the same diagnosis. The meta-paths in Table 1 guide the random walk of the patient multi-relational graph. We propose the PM-GSL, a diabetes clinical assistant diagnosis model for patient multi-relational graph structure learning, as shown in Figure 1. First, we analyze the graph connectivity based on the PM-GSL to jointly train and optimize the patient multi-relational graph structure and GNNs parameters. Second, a semantic embedding matrix is constructed based on the node embedding method of meta-paths. Considering rn as an example, we use the PM-GSL model to obtain a new overall feature graph ΓFrn and a higher-order semantic graph ΓSrn and fuse these two graphs with the original subgraph Arn to obtain a new subgraph Arn. Finally, the learned subgraphs are input into the GNN and regularizer to output the diagnosis and prediction of diabetes.

    Table 1.  Meta-paths in the patient multi-relational graph.
    Node types Semantic Meta-paths
    Patient (P) Two patients cite the same medical concept. Patient Age Patient (PAP)
    Age (A) Patient Medicine Patient (PMP)
    Medication (M) Patient Diagnosis Patient (PDP)
    Diagnosis (D) Patient Test Patient (PTP)
    Laboratory Test (T)

     | Show Table
    DownLoad: CSV
    Figure 1.  PM-GSL framework.

    From the graph representation learning perspective, GNNs aim to learn node embedding by aggregating the information of neighboring nodes [29]. This iterative mechanism relies on the inherent information in the graph structure and attributes. However, artificially designed and constructed graphs have inevitable redundant noise and misinformation, which limits the predictive performance and interpretability of most GNNs, especially in smart healthcare, which requires higher graph quality. To optimize the patient multi-relational graph, the PM-GSL aims to fully use node features to enhance the original graph structure, which is mainly reflected in two aspects. First, we generate a feature similarity graph based on the similarity between node features. Then, node features are propagated to specific domains in the multi-relational graph to generate a message propagation graph, as shown in Figure 2.

    Figure 2.  Message propagation graph.

    The pairwise connection of nodes is one of the most direct ways to represent information in the patient multi-relational graph. To preserve the pairwise proximity attribute between nodes in the patient multi-relational graph, the PM-GSL model determines the possibility of edges between two nodes with relation type rnR based on node features.

    Specifically, for a node vi with node type φ(vi) and node eigenvector fiR1d, the node features are projected into the d-dimensional common space using a type-specific mapping matrix Wφ(vi), which is denoted as fiR1×d:

    fi=σ(Wφ(vi)fi+bφ(vi)) (4.1)

    σ() denotes the non-linear activation function, WRd×d denotes the mapping matrix, and bR1×d denotes the bias vector. In the patient multi-relational graph, for relation rn, we use weighted cosine similarity as the measurement function to measure node similarity and obtain a feature similarity graph ΓFSrnR|ϖh(rn)|×|ϖe(rn)|. The similarity between nodes vi and vj is expressed as follows:

    δFSrn(fvi,fvj)=1KKkcos(QFSk,rnfvi,QFSk,rnfvj) (4.2)

    where fvi and fvj are the node features after projection, and QFSrn is the learnable parameter matrix. After the calculation, the node feature similarity matrix ΓFSrn is generated, and is symmetric, and the element range of the matrix is [-1, 1]. Therefore, we must extract a non-negative and sparse matrix from ΓFSrn. Specifically, we defined a non-negative threshold λFS for automatic learning and set the elements that are smaller than λFS to 0. If λFS is larger, then ΓFSrn will be sparser, defined as follows:

    ΓFSrn(vi,vj)={δFSrn(fvi,fvj)    δFSrn(fvi,fvj)λFS0                            otherwise (4.3)

    The message propagation graph captures the interaction between the node features and topology, and it can find new effective edges more accurately. The key idea of the message propagation graph includes two aspects: First, many nodes of the same class have similar features and may be far apart in the feature space in the patient multi-relational graph structure data. Second, two nodes with similar node features are more likely to have similar neighbor nodes, and the feature similarity graph is propagated through the topology to generate new edges.

    The node types corresponding to the two nodes that satisfy the relation rn in the patient multi-relational graph are ϖh(rn) and ϖe(rn), and the topology between them is ArnR|ϖh(rn)|×|ϖe(rn)|. For two nodes vt,vsϖh(rn), we calculate the feature similarity:

    δFHrn(fvt,fvs)=cos(QFHrnfvt,QFHrnfvs) (4.4)

    Similar to Eq (4.3), a threshold λFH is set to control the sparsity of the feature similarity graph ΓFHrn, as shown in Eq (4.5). After obtaining the similarity of vt,vsϖh(rn), a potential message propagation graph ΓCHrnR|ϖh(rn)|×|ϖe(rn)| is generated based on the feature similarity graph ΓFHrn and original subgraph Arn, as shown in Eq (4.6):

    ΓFHrn(vt,vs)={δFHrn(fvt,fvs)         δFHrn(fvt,fvs)λFH0                               otherwise (4.5)
    ΓCHrn=ΓFHrnArn (4.6)

    Similarly, for the same type of vp,vqϖe(rn), we calculate Eqs (4.4) and (4.5) to obtain another corresponding feature similarity graph ΓFTrn and the message propagation graph ΓCTrnR|ϖh(rn)|×|ϖe(rn)|:

    ΓCTrn=ArnΓFTrn (4.7)

    In summary, we fuse the generated feature similarity graph ΓFSrn and the message propagation graphs ΓCHrn and ΓCTrn through the channel attention layer [21] to obtain the overall feature graph ΓFrnR|ϖh(rn)|×|ϖe(rn)| of relation rn:

    ΓFrn = ΨFrn([ΓFSrn,ΓCHrn,ΓCTrn]) (4.8)

    where [ΓFSrn,ΓCHrn,ΓCTrn]R|ϖh(rn)|×|ϖe(rn)|×3 denotes the stacking matrix of the candidate graphs that generate the overall feature graph. ΨFrn represents the channel attention layer, and the corresponding parameter is QFrnR1×1×3. The channel attention layer uses the function softmax(QFrn) to perform 1×1 convolution operations on the input. In this way, the PM-GSL model learns the different weights of the three candidate graphs of the overall feature graph respectively to measure the importance of each candidate graph to relation rn.

    The semantic graph relies on the higher-order topology of the heterogeneous information network, which aims to describe the multi-hop structural interaction between two nodes determined by meta-paths. In the patient multi-relational graph, the overall feature graph describes the nodes from a microscopic perspective, whereas the higher-order semantic graph describes the nodes from a macroscopic perspective. The higher-order relations reflected by different meta-paths contain different semantic information. We obtain the higher-order semantic graph structure of the patient multi-relational graph from multiple meta-paths, as shown in Figure 3. Given a sequence r1,r2,...,rn of the meta-path ϕ, a common method to generate a higher-order semantic graph is to multiply multiple adjacency matrices, such as the adjacency matrix Aϕ=Arn...Ar2Ar1. However, this method of obtaining high-order semantic graph by multiplying multiple adjacency matrices requires a considerable amount of time and space, and it discards the information of intermediate semantic nodes in the meta-paths.

    Figure 3.  Higher-order semantic graph.

    In the patient multi-relational graph, a set of meta-paths {ϕ1,ϕ2,...,ϕm} exists. Based on metapath2vec [12], we obtained the meta-paths based node semantic embedding representation, Z={Zϕ1,Zϕ2,...,Zϕm}, to learn the potential higher-order semantic graph structure of the patient multi-relational graph. The entire training process adopts the meta-paths guidance random walk strategy and heterogeneous skip-gram model learning node embedding representation, which significantly retains the information of intermediate semantic nodes [30]. In addition, the training process of the metapath2vec model does not involve adjacency matrix multiplication, which reduces the time and space complexity.

    After obtaining the node semantic embedding representation Z, for each meta-path ϕm in the patient multi-relational graph, we generate the adjacency matrix ΓMSrn,ϕmR|ϖh(rn)|×|ϖe(rn)|, for the semantic subgraph, and the calculation method for each edge is

    ΓMSrn,ϕm(i,j)={δMSrn,ϕm(zmi,zmj) = cos(QMSrnzmi,QMSrnzmj)      δMSrn,ϕm(zmi,zmj)λMS0                            otherwise  (4.9)

    where zmi denotes the i-th row of Zϕm. δMSrn,ϕm is a metric function with parameter QMSrn. We extend this calculation method to all meta-paths to generate m semantic subgraphs and aggregate them to obtain a higher-order semantic graph ΓSrn:

    ΓSrn=ΨMSrn([ΓMSrn,ϕ1,ΓMSrn,ϕ2,...,ΓMSrn,ϕm]) (4.10)

    where [ΓMSrn,ϕ1,ΓMSrn,ϕ2,...,ΓMSrn,ϕm] is the stacking matrix of the corresponding m semantic subgraphs. ΨMSrn represents the channel attention layer, and the corresponding weight matrix QMSrnR1×1×m denotes the importance of semantic subgraphs based on different meta-paths of relation rn. After obtaining ΓSrn, we need to aggregate the learned overall feature graph ΓFrn and higher-order semantic graph ΓSrn, with the original patient subgraph Arn to generate a new relational subgraph Arn of relation rn:

    Arn = Ψrn([ΓFrn,ΓSrn,Arn]) (4.11)

    Similarly, [ΓFrn,ΓSrn,Arn] is the stacking matrix of the generated new relational adjacency matrix. Weight QrnR1×1×3 of candidate graphs is learned from the channel attention layer Ψrn, to evaluate the importance of the three candidate graphs in generating the relational subgraph Arn. Finally, we obtain a new patient multi-relational graph A={Arn,rnR} in the PM-GSL model.

    In the PM-GSL model, we combine the GNN parameters to optimize the new graph structure A. Considering that the adjacency matrix of the original heterogeneous graph may provide misleading information for the aggregation process, we aggregate the feature representation of neighbor nodes through the obtained patient multi-relational graph structure and apply it to diabetes clinical assistant diagnosis. For the patient multi-relational graph structure A, the forward model of the two-layer GCN can be expressed as follows:

    Y = f(X,F)=softmax(ˆAReLU(ˆAXW0)W1) (4.12)

    where XRN×d is the original feature matrix of the node and the input to the first GCN layer. F is an adjacency matrix constructed using A. ˆA=˜D12˜A˜D12, ˜Dii=1+jFij, ˜A=F+IN, and IN are unit matrices. The classification loss on the graph is expressed as follows:

    LGNN=viVl(Yvi,yvi) (4.13)

    where Yvi is the prediction label, and l(,) indicates the cross-entropy loss function, which measures the error between the prediction label Yvi and the real label yvi.

    The patient multi-relational graph constructed from multiple pieces of information in real-world EHRs is noisy and has task-independent edges. We propose the PM-GSL model, which makes the original patient multi-relational graph more adaptable to the prediction task but more prone to overfitting problems. Therefore, we impose sparsity constraints on the adjacency matrix F of the graph A and apply the regularization term to the learned graph as follows:

    LREG=β||F||1 (4.14)

    The total loss of the PM-GSL model is described by Eq (4.15). Minimizing L can enable the joint optimization of the graph structure and GNN parameters to improve the prediction task performance.

    L = LGNN+LREG (4.15)

    We use two real EHRs datasets to evaluate the PM-GSL model: one from American hospitals and another from Chinese hospitals. Then, we compare the PM-GSL model with state-of-the-art methods. Specifically, we aim to answer the following four questions:

    RQ1: Does the PM-GSL model improve the accuracy of a diabetes diagnosis?

    RQ2: How important are the three types of candidate graphs in the PM-GSL model for generating new graph structures?

    RQ3: Does PM-GSL adaptively assign greater channel attention values to important information?

    RQ4: Is graph structure learning effective in improving the quality of the patient multi-relational graph?

    Our work is tested and analyzed by using two datasets. MIMIC-Ⅳ is a large international public EHRs dataset that collected clinical data of over 380,000 patients from 2008 to 2019. This dataset was collected at the Beth Israel Deaconess Medical Center in Boston, Massachusetts, USA, and included patient centered clinical records, such as demographics, vital sign measurements, nursing notes, and laboratory tests.

    Furthermore, we also use the EHRs of a tertiary care hospital located in a major metropolitan center in northwestern China called P-EHRs. Although the amount and completeness of the data are relatively small, the information included in this study is sufficient to support our work. More specifically, there was 1) demographic information, such as age and sex. 2) Diagnosis: The combination group is defined according to the coding information of ICD-9 and ICD-10, mainly including 14 diseases, such as myocardial infarction, congestive heart failure, hyperlipidemia, hypertension, diabetes, and chronic lung disease. 3) Laboratory tests: We extracted the laboratory test indicators obtained for the first time and the minimum, mean, and maximum values of laboratory tests during hospitalization. 4) Vital signs: We selected the minimum, mean, and maximum values of the patient's vital signs during the first day of admission, which included heart rate, systolic blood pressure, diastolic blood pressure, mean arterial pressure, respiratory rate, and percutaneous oxygen saturation. 5) Medications: We collected data on drugs that were used more than 300 times in total during the patients' hospitalization and classified them into 20 categories according to the functions of these drugs. Over all, we briefly summarize in Tables 2 and 3.

    Table 2.  Node statistics of the patient multi-relational graph.
    Dataset MIMIC-Ⅳ P-EHRs
    Quantity/Types Nodes Quantity/Types Nodes
    Patient 6000/3 6000 2000/3 2000
    Age 4 4 4 4
    Diagnosis 14 14 6 6
    Medication 101 20 65 13
    Laboratory test 38 76 26 52
    Vital sign 6 12 6 12

     | Show Table
    DownLoad: CSV
    Table 3.  Statistics of the datasets.
    Datasets Nodes Edges Edge types Features Training Validation Test
    MIMIC-Ⅳ 6122 118,717 4 107 2700 1200 2100
    P-EHRs 2083 29,572 4 76 1700 400 900

     | Show Table
    DownLoad: CSV

    The experiment is based on Python 3.6.8 and TensorFlow 2.3.0. The calculation of a single model runs on NVIDIA RTX 6000 GPU. According to [30], in all the methods involved in this study, the node embedding dimension d and the common space dimension d were set to 64 and 16, respectively. Equation (4.2) in Section 4 represents a 2-head cosine similarity measure function, that is, K = 2. The learning rate and weight decay are set to 0.01 and 0.0005, respectively. The other hyperparameters are adjusted by using the grid search method, which is reflected in λFS, λFH, λMS and β for Eqs (4.3), (4.5), (4.9), and (4.14).

    We compare the PM-GSL with six state-of-the-art baseline graph neural network models in recent years. They include two homogeneous graphs (GCN [7] and GAT [12]), two heterogeneous graphs (metapath2vec [13] and HAN [14]), and two graph structure learning methods (LDS [20] and Pro-GNN [21]). We conduct six ablation experiments to verify the effectiveness of the three candidate graphs in the PM-GSL for diabetes diagnosis.

    In this section, we compare the overall performance of the PM-GSL and six baseline models, as shown in Figure 4. The classification process was repeated ten times to obtain the average value and ensure more stable and reliable prediction results. We use the evaluation metrics Macro-F1, Micro-F1, and AUC, which are commonly used in classification tasks.

    Figure 4.  Performance evaluation of diabetes diagnosis.

    The experimental findings are as follows:

    (ⅰ) The PM-GSL model consistently outperforms the three types of baselines in predicting both EHRs datasets, which not only indicates that noisy data in the original EHRs prevent the GNN from aggregating effective feature information but also proves that PM-GSL can obtain a higher-quality heterogeneous graph structure.

    (ⅱ) Compared with GCN and GAT, the performance of PM-GSL on the two EHRs datasets was significantly improved by 13.36–17.59%, which confirms that the fine-grained division of the patient multi-relational graph is extremely conducive to the diabetes diagnosis.

    (ⅲ) Although the metapath2vec and HAN models can be used to analyze heterogeneous relations, the prediction performance of these two models is 9.49–14.39% lower than that of the PM-GSL model proposed in this study. This is because the density of neighbor nodes in each relation in the patient multi relation graph is high. In particular, the HAN model directly uses the original graph structure as the input and cannot effectively filter out neighbor nodes with interference factors, which causes difficulties in diabetes diagnosis.

    (ⅳ) Compared with the LDS model and Pro-GNN model, the PM-GSL model proposed in this study can improve the prediction performance by 7.26–9.00%. In particular, the PM-GSL model can calculate node similarity and message propagation path and can make full use of multiple patient relations, which is helpful for learning better graph structure and more robust GNN parameters; thus, the new patient multi-relational graph has a stronger ability to adapt to prediction tasks.

    To further verify the impact of the three core candidate graphs in the PM-GSL model, we designed three variants of the PM-GSL model by deleting any type of candidate graphs, which are respectively represented as PM-GSLFS, PM-GSLHT and PM-GSLS*. Table 4 shows the results of these variants in terms of the Macro-F1, Micro-F1 and AUC, and we can observe the following:

    Table 4.  PM-GSL ablation experiment.
    Dataset MIMIC-Ⅳ P-EHRs
    Macro-F1 Micro-F1 AUC Macro-F1 Micro-F1 AUC
    PM-GSLFS 0.7714 0.7917 0.8538 0.7167 0.7045 0.7489
    L-PM-GSLHT 0.8041 0.7933 0.8678 0.7383 0.7248 0.7744
    R-PM-GSLHT 0.7920 0.7867 0.8346 0.7817 0.7823 0.8351
    PM-GSLHT 0.7667 0.7333 0.7955 0.7367 0.7443 0.8195
    PM-GSLS* 0.6963 0.7133 0.7679 0.6433 0.6729 0.7351
    PM- GSLavg 0.8537 0.8469 0.8907 0.8117 0.8373 0.8796
    PM-GSL 0.8783 0.8954 0.9337 0.8579 0.8800 0.9164

     | Show Table
    DownLoad: CSV

    (ⅰ) The PM-GSL model outperforms the three ablation studies on diabetes diagnostic task, indicating the necessity of all candidate graphs in generating new graph structures.

    (ⅱ) Compared with the other two ablation studies, PM-GSLS* significantly decreased in the three evaluation indicators, because the semantic information between different relations is more important in the patient multi-relational graph.

    (ⅲ) To verify the effectiveness of the candidate graph weights learned by the PM-GSL, we replace the channel attention layer in Eq (4.11) with an average aggregation layer that treats each type of candidate graph equally. The effect of the channel attention layer is better than that of the PM- GSLavg. If on average, three core candidate graphs are fused in the PM-GSL model, the influence of higher-order semantic information in the patient multi-relational graph will be weakened, and the prediction performance will be reduced.

    We analyze the distribution of the channel attention weight, further studied the ability of the PM-GSL model to distinguish the importance of three core candidate graphs and fuse the three core candidate graphs to generate a new patient multi-relational subgraph. We trained the PM-GSL model 50 times and set all thresholds to 0.2. The distribution of channel attention values is shown in Figure 5.

    Figure 5.  Performance evaluation of diabetes diagnosis.

    Specifically, the original subgraph is an important structure for diabetes prediction. However, for P-T, the PM-GSL model assigns a large channel attention value to the learned higher-order semantic graph, indicating that the information in the higher-order semantic graph is more important. The above phenomenon is consistent with the conclusion (ⅲ) from the ablation experiment, which further proves that the PM-GSL model can adaptively provide more channel attention to important information.

    The similarity thresholds defined in Eqs (4.3), (4.5) and (4.9) can be used to control the sparsity of generated graphs. To better compare the experimental results, we set up λFS = λFH and conduct experiments on different λFS and λMS. Figure 6 shows the Macro-F1 and AUC values in the comparison experiment. On the two datasets, when both λFS = λFH and λMS are set to 1, the prediction performance of the PM-GSL model is significantly reduced. This is because the PM-GSL model uses only the original graph structure of the patient multi-relational graph, which is similar to the general GNN model. This finding also proves the effectiveness of graph structure learning in improving the quality of the patient's multi-relational graph.

    Figure 6.  Parameter sensitivity.

    In this study, we attempt to incorporate heterogeneous graph structure learning into the task of diabetes clinical assistant diagnosis and propose a model, PM-GSL, which jointly learns patient multi-relational graph structures and GNN parameters for diabetes prediction. In particular, we use the effective information in EHRs to build a patient multi-relational graph to simulate the complex interactions between multiple medical entities in EHRs. The patient multi-relational graphs propagate the node characteristics through the topology to generate the underlying graph structure and describe the nodes from micro and macro perspectives, respectively. We designed three fine candidate graphs and fused them to generate clean and effective heterogeneous graphs to solve the weak structure problem of the EHRs.

    Numerous experiments have proven the effectiveness of the PM-GSL model, which well proves that graph structure learning is helpful to improve the quality of patient multi-relational graph and the accuracy of disease prediction tasks. Our proposed PM-GSL model achieved a 9.62% improvement in Macro-F1, 9.17% improvement in Micro-F1 and 10.33% in AUC compared to the baselines. In the future, we aim to extend PM-GSL to the multi-view model and explicitly integrate the label information into graph structure learning to improve the performance of disease clinical assistant diagnosis and enhance the clinical interpretability of the model.

    We would like to thank the anonymous reviewers for their valuable comments. This research was funded by the National Natural Science Foundation of China (No. 61863032), Northwest Normal University Major Research Project Incubation Program, China (No. NWNU-LKZD2021-06).

    The authors declare there is no conflict of interest.



    [1] T. Hirota, R. So, Y. S. Kim, B. Leventhal, R. A. Epstein, A systematic review of screening tools in non-young children and adults for autism spectrum disorder, Res. Dev. Disabil., 80 (2018), 1–12. https://doi.org/10.1016/j.ridd.2018.05.017 doi: 10.1016/j.ridd.2018.05.017
    [2] C. Lord, S. Risi, P.S. Dilavore, C. Shulman, A. Thurm, A. Pickles. Autism from 2 to 9 years of age. Arch. Gen. Psychiatry, 63 (2006), 694–701. https://doi.org/10.1001/archpsyc.63.6.694 doi: 10.1001/archpsyc.63.6.694
    [3] B. B. Sizoo, E. H. Horwitz, J. P. Teunisse, C. C. Kan, C. Vissers, E. Forceville, et al., Predictive validity of self-report questionnaires in the assessment of autism spectrum disorders in adults, Autism, 19 (2015), 842–849. https://doi.org/10.1177/1362361315589869 doi: 10.1177/1362361315589869
    [4] P. O. Towle, P. A. Patrick, Autism Spectrum Disorder Screening Instruments for Very Young Children: A Systematic Review, Autism. Res. Treat., 2016 (2016), 4624829. https://doi.org/10.1155/2016/4624829 doi: 10.1155/2016/4624829
    [5] D. Bone, S. Bishop, M. P. Black, M. S. Goodwin, C. Lord, S. S. Narayanan, Use of machine learning to improve autism screening and diagnostic instruments: effectiveness, efficiency, and multi-Instrument Fusion, J. Child. Psychol. Psychiatry, 57 (2017), 927–937. https://doi.org/10.1111/jcpp.12559 doi: 10.1111/jcpp.12559
    [6] J. A. Kosmicki, V. Sochat, M. Duda, D. P. Wall, Searching for a minimal set of behaviors for autism detection through feature selection-based machine learning, Transl. Psychiatry, 5 (2015), 514–517. https://doi.org/10.1038/tp.2015.7 doi: 10.1038/tp.2015.7
    [7] F. Thabtah, Machine learning in autistic spectrum disorder behavioral research: A review and ways forward, Informatics Heal. Soc. Care, 44 (2019), 278–297. https://doi.org/10.1080/17538157.2017.1399132 doi: 10.1080/17538157.2017.1399132
    [8] D. H. Oh, I. B. Kim, S. H. Kim, D. H. Ahn, Predicting autism spectrum disorder using blood-based gene expression signatures and machine learning, Clin. Psychopharmacol. Neurosci., 15 (2017), 47–52. https://doi.org/10.9758/cpn.2017.15.1.47 doi: 10.9758/cpn.2017.15.1.47
    [9] M. Duda, R. Ma, N. Haber, D. P. Wall, Use of machine learning for behavioral distinction of autism and ADHD, Transl. Psychiatry, 6 (2016), 732. https://doi.org/10.1038/tp.2015.221 doi: 10.1038/tp.2015.221
    [10] G. Li, O. Lee, H. Rabitz, High-efficiency classification of children with autism spectrum disorder, PLoS One, 13 (2018), 1–23. https://doi.org/10.1371/journal.pone.0192867 doi: 10.1371/journal.pone.0192867
    [11] Q. Tariq, S. L. Fleming, J. N. Schwartz, K. Dunlap, C. Corbin, P. Washington, et al., Detecting Developmental Delay and Autism Through Machine Learning Models Using Home Videos of Bangladeshi Children: Development and Validation Study, J. Med. Internet Res., 21 (2019), 13822. https://doi.org/10.2196/13822 doi: 10.2196/13822
    [12] D. Eman, W. R. Emanuel, Machine Learning Classifiers for Autism Spectrum Disorder: A Review, 2019 4th Int. Conf. Inform. Technol. Inform. Syst. Electr. Eng. (ICITISEE), Yogyakarta, Indonesia, 2019. https://doi.org/10.1109/ICITISEE48480.2019.9003807
    [13] X. Bi, Y. Wang, Q. Shu, Q. Sun, Q. Xu, Classification of autism spectrum disorder using random support vector machine cluster, Frontiers in Genetics, 6 (2018), 9–18. https://doi.org/10.3389/fgene.2018.00018 doi: 10.3389/fgene.2018.00018
    [14] E. Grossi, C. Olivieri, M. Buscema, Diagnosis of autism through EEG processed by advanced computational algorithms: a pilot study, Comput. Methods Programs Biomed., 142 (2017), 73–79. https://doi.org/10.1016/j.cmpb.2017.02.002 doi: 10.1016/j.cmpb.2017.02.002
    [15] M. L. Raja, M. Priya, Neural network based classification of EEG signals for diagnosis of autism spectrum disorder, Int. J. Pharm. Bio. Sci., 8 (2017), 1020–1026.
    [16] L. Raja, M. M. Priyab, EEG based ASD diagnosis for children using auto-regressive features and FFNN, Int. J. Control Theo. App., 10 (2017), 27–32.
    [17] L. Raja, M. M. Priya, EEG based diagnosis of autism spectrum disorder using static and dynamic neural networks, ARPN J. Eng. Appl. Sci., 12 (2017), 4653787.
    [18] R. Djemal, K. AlSharabi, S. Ibrahim, A. Alsuwailem, EEG-based computer aided diagnosis of autism spectrum disorder using wavelet, entropy, and ANN, BioMed. Res. Int., 2017 (2017), 1–9. https://doi.org/10.1155/2017/9816591 doi: 10.1155/2017/9816591
    [19] T. M. Heunis, C. Aldrich, P. J. Vries, Recent Advances in Resting-State Electroencephalography Biomarkers for Autism Spectrum Disorder-A Review of Methodological and Clinical Challenges, Rev. Pediatr. Neurol., 61 (2016), 28–37. https://doi.org/10.1016/j.pediatrneurol.2016.03.010 doi: 10.1016/j.pediatrneurol.2016.03.010
    [20] N. P. Jordanova, J. P. Jordanov, Spectrum-weighted EEG frequency ("brain-rate") as a quantitative indicator of mental arousal. Prilozi, 26 (2005), 35–42.
    [21] E. Abdulhay, M. Alafeef, A. Abdelhay, A. Al-Bashir, Classification of Normal, Ictal and Inter-ictal EEG via Direct Quadrature and Random Forest Tree, J. Med. Biol. Eng., 37 (2017), 843–857. https://doi.org/10.1007/s40846-017-0239-z doi: 10.1007/s40846-017-0239-z
    [22] Z. Dandan, D. Haiyan, H. Xinlin, L. Yunfeng, Z. Congle, Y. Datian, The Combination of Amplitude and Sample Entropy in EEG and its Application to Assessment of Cerebral Injuries in Piglets, 2008 Int. Conf. BioMed. Eng. Informatics, Sanya, China, 2008. https://doi.org/10.1109/BMEI.2008.12
    [23] E. Abdulhay, M. Alafeef, L. Alzghoul, M. Al Momani, R. Al Abdi, N. Arunkumar, et al., Computer-aided autism diagnosis via second-order difference plot area applied to EEG empirical mode decomposition, Neural Comput. Appl., 32 (2020), 10947–10956. https://doi.org/10.1007/s00521-018-3738-0 doi: 10.1007/s00521-018-3738-0
    [24] R. J. Oweis, E. W. Abdulhay, Seizure classification in EEG signals utilizing Hilbert-Huang transform, Biomed. Eng. Online, 10 (2011), 38. https://doi.org/10.1186/1475-925X-10-38 doi: 10.1186/1475-925X-10-38
    [25] E. Abdulhay, M. Alafeef, H. Hadoush, N. Alomari, M. Bashayreh, Frequency 3D Mapping and Inter-Channel Stability of EEG Intrinsic Function Pulsation: Indicators Towards Autism Spectrum Diagnosis, 2017 10th Jordanian Int. Electric. Electron. Eng. Conf. (JIEEEC), Amman, Jordan, 2017. https://doi.org/10.1109/JIEEEC.2017.8051416
    [26] H. Hadoush, M. Alafeef, E. Abdulhay, Automated identification for autism severity level: EEG analysis using empirical mode decomposition and second order difference plot, Behavioural Brain Res., 362 (2019), 240–248. https://doi.org/10.1016/j.bbr.2019.01.018 doi: 10.1016/j.bbr.2019.01.018
    [27] E. Abdulhay, V. Elamaran, M. Chandrasekar, V. S. Balaji, and K. Narasimhan, Automated diagnosis of epilepsy from EEG signals using ensemble learning approach, Pattern Recognition Letters, 139 (2020), 174–181. https://doi.org/10.1016/j.patrec.2017.05.021 doi: 10.1016/j.patrec.2017.05.021
    [28] T. H. Pham, J. Vicnesh, J. K. Wei, S. J. Oh, N. Arunkumar, E. Abdulhay, et al., Autism spectrum disorder diagnostic system using HOS bispectrum with EEG signals, Int. J. Environ. Res. Public Health, 17 (2020), 1–14. https://doi.org/10.3390/ijerph17030971 doi: 10.3390/ijerph17030971
    [29] W. Bosl, A. Tierney, H. T. Flusberg, C. Nelson, EEG complexity as a biomarker for autism spectrum disorder risk, BMC Med., 9 (2011), 18. https://doi.org/10.1186/1741-7015-9-18 doi: 10.1186/1741-7015-9-18
    [30] F. H. Duffy, A. Heidelise, Autism, spectrum or clusters? An EEG coherence study, BMC Neurol., 19 (2019), 27. https://doi.org/10.1186/s12883-019-1254-1 doi: 10.1186/s12883-019-1254-1
    [31] A. Sheikhani, H. Behnam, M. R. Mohammadi, M. Noroozian, Analysis of EEG background activity in Autsim disease patients with bispectrum and STFT measure, Proceedings of the 11th WSEAS Int. Conf. Commun., Agios Nikolaos, Greece, 2007.
    [32] J. Kang, H. Chen, X. Li, X. Li, EEG entropy analysis in autistic children, J. Clin. Neurosci., 62 (2019), 199–206. https://doi.org/10.1016/j.jocn.2018.11.027 doi: 10.1016/j.jocn.2018.11.027
    [33] L. Billeci, F. Sicca, K. Maharatna, F. Apicella, A. Narzisi, G. Campatelli, et al., On the application of quantitative EEG for characterizing autistic brain: a systematic review, Front. Hum. Neurosci., 7 (2013), 442. https://doi.org/10.3389/fnhum.2013.00442 doi: 10.3389/fnhum.2013.00442
    [34] M. Ahmadlou, H. Adeli, A. Adeli, Fractality and a wavelet-chaos-neural network methodology for EEG-based diagnosis of autistic spectrum disorder, J. Clin. Neurophysiol., 27 (2010), 328–333. https://doi.org/10.1097/WNP.0b013e3181f40dc8 doi: 10.1097/WNP.0b013e3181f40dc8
    [35] B. B. Mandelbrot, The Fractal Geometry of Nature. New York: Freeman and Company (1977), 1–468.
    [36] M. Costa, A. L. Goldberger, C. K. Peng, Multiscale entropy analysis of biological signals. Phys. Rev. E., 71 (2005), 021906. https://doi.org/10.1103/PhysRevE.71.021906 doi: 10.1103/PhysRevE.71.021906
    [37] A. Namdari, Z. Li, A review of entropy measures for uncertainty quantification of stochastic processes, Adv. Mechanical Eng., 11 (2019), 1–14. https://doi.org/10.1177/1687814019857350 doi: 10.1177/1687814019857350
    [38] H. Hadoush, M. Alafeef, E. Abdulhay, Brain complexity in children with mild and severe autism spectrum disorders: analysis of multiscale entropy in EEG, Brain Topography, 32 (2019), 914–921. https://doi.org/10.1007/s10548-019-00711-1 doi: 10.1007/s10548-019-00711-1
    [39] Y. Ghanbari, L. Bloy, J. C. Edgar, L. Blaskey, R. Verma, T. P. Roberts, Joint analysis of band-specific functional connectivity and signal complexity in autism, J. Autism Dev. Disord., 45 (2015), 444–460. https://doi.org/10.1007/s10803-013-1915-7 doi: 10.1007/s10803-013-1915-7
    [40] T. Liu, Y. Chen, D. Chen, C. Li, Y. Qiu, J. Wang, Altered electroencephalogram complexity in autistic children shown by the multiscale entropy approach, Neuro. Report, 28 (2017), 169–173. https://doi.org/10.1097/WNR.0000000000000724 doi: 10.1097/WNR.0000000000000724
    [41] J. O. Maximo, D. L. Murdaugh, R. K. Kana, Alterations in Brain Entropy in Autism Spectrum Disorders, 2017 Int. Meet. Autism Res., Birmingham, USA, 2017.
    [42] J. Q. Kosciessa, N. A. Kloosterman, D. D. Garrett, Standard multiscale entropy reflects neural dynamics at mismatched temporal scales: What's signal irregularity got to do with it?, PLOS Comput. Biol., 16 (2020), e1007885. https://doi.org/10.1371/journal.pcbi.1007885 doi: 10.1371/journal.pcbi.1007885
    [43] A. Catarino, O. Churches, S. B. Cohen, A. Andrade, H. Ring, Atypical EEG complexity in autism spectrum conditions: a multiscale, entropy analysis, Clin. Neurophysiol., 122 (2011), 2375–2383. https://doi.org/10.1016/j.clinph.2011.05.004 doi: 10.1016/j.clinph.2011.05.004
    [44] J. S. Richman, J. R. Moorman, Physiological time-series analysis using approximate entropy and sample entropy, Am. J. Physiol. Heart Circ. Physiol., 278 (2000), H2039–49. https://doi.org/10.1152/ajpheart.2000.278.6.H2039 doi: 10.1152/ajpheart.2000.278.6.H2039
    [45] R. Ferenets, T. Lipping, A. Anier, V. Jantti, S. Melto, S. Hovilehto, Comparison of entropy and complexity measures for the assessment of depth of sedation, IEEE Trans. Biomed. Eng., 53 (2006), 1067–1077. https://doi.org/10.1109/TBME.2006.873543 doi: 10.1109/TBME.2006.873543
    [46] A. H. Heurtier, The Multiscale Entropy Algorithm and Its Variants: A Review, Entropy, 17 (2015), 3110–3123. https://doi.org/10.3390/e17053110 doi: 10.3390/e17053110
    [47] H. Azami and J. Escudero, Amplitude- and Fluctuation-Based Dispersion Entropy, Entropy, 20 (2018), 210. https://doi.org/10.3390/e20030210 doi: 10.3390/e20030210
    [48] J. F. Valencia, A. Porta, M. Vallverdu, F. Claria, R. Baranowski, E. O. Baranowska, et al., Refined multiscale entropy: Application to 24-h Holter recordings of heart period variability in healthy and aortic stenosis subjects, IEEE Trans. Biomed., 56 (2009), 2202–2213. https://doi.org/10.1109/TBME.2009.2021986 doi: 10.1109/TBME.2009.2021986
    [49] J. F. Valencia, M. Vallverdu, R. Schroeder, L. Cygankiewicz, R. Vazquez, A. B. Luna, et al., Heart rate variability characterized by refined multiscale entropy applied to cardiac death in ischemic cardiomyopathy patients, Comput. Cardiol., 37 (2010), 65–68.
    [50] W. J. Bosl, T. Loddenkemper, C. A. Nelson, Nonlinear EEG biomarker profiles for autism and absence epilepsy, Neuropsychiatric Electrophysiology, 3 (2017), 1. https://doi.org/10.1186/s40810-017-0023-x doi: 10.1186/s40810-017-0023-x
    [51] W. J. Bosl, H. T. Flusberg, C. A. Nelson, EEG Analytics for Early Detection of Autism Spectrum Disorder: A data-driven approach, Sci. Rep., 8 (2018), 6828. https://doi.org/10.1038/s41598-018-24318-x doi: 10.1038/s41598-018-24318-x
    [52] S. D. Wu, C.W. Wu, K.Y. Lee, S. G. Lin, Modified multiscale entropy for short-term time series analysis, Physica A, 392 (2013), 15865–5873. https://doi.org/10.1016/j.physa.2013.07.075 doi: 10.1016/j.physa.2013.07.075
    [53] S. D. Wu, C. W. Wu, S. G. Lin, C. C. Wang, K. Y. Lee, Time series analysis using composite multiscale entropy, Entropy, 15 (2013), 1069–1084. https://doi.org/10.3390/e15031069 doi: 10.3390/e15031069
    [54] S. D. Wu, C. W. Wu, S. G. Lin, K. Y. Lee, C. K. Peng, Analysis of complex time series using refined composite multiscale entropy, Phys. Lett. A, 378 (2014), 1369–1374. https://doi.org/10.1016/j.physleta.2014.03.034 doi: 10.1016/j.physleta.2014.03.034
    [55] S. D. Wu, C. W. Wu, K. Y. Lee, S. G. Lin, Modified multiscale entropy for short-term time series analysis, Phys. A, 392 (2013), 5865–5873. https://doi.org/10.1016/j.physa.2013.07.075 doi: 10.1016/j.physa.2013.07.075
    [56] Y. C. Chang, H. T. Wu, H. R. Chen, A. B. Liu, J. J. Yeh, M. T. Lo, et al., Application of a modified entropy computational method in assessing the complexity of pulse wave velocity signals in healthy and diabetic subjects, Entropy, 16 (2014), 4032–4043. https://doi.org/10.3390/e16074032 doi: 10.3390/e16074032
    [57] Y. Jiang, C. K. Peng, Y. Xu, Hierarchical entropy analysis for biological signals, J. Comput. Appl. Math., 236 (2011), 728–742. https://doi.org/10.1016/j.cam.2011.06.007 doi: 10.1016/j.cam.2011.06.007
    [58] H. B. Xie, W. X. He, H. Liu, Measuring time series regularity using nonlinear similarity-based sample entropy, Phys. Lett. A, 372 (2008), 7140–7146. https://doi.org/10.1016/j.physleta.2008.10.049 doi: 10.1016/j.physleta.2008.10.049
    [59] M. U. Ahmed, D. P. Mandic, Multivariate multiscale entropy analysis, IEEE Signal Process. Lett., 19 (2012), 91–94. https://doi.org/10.1109/LSP.2011.2180713 doi: 10.1109/LSP.2011.2180713
    [60] M. D. Costa, A. L. Goldberger, Generalized multiscale entropy analysis: Application to quantifying the complex volatility of human heartbeat time series, Entropy, 17 (2015), 1197–1203. https://doi.org/10.3390/e17031197 doi: 10.3390/e17031197
    [61] L. Faes, A. Porta, M. Javorka, G. Nollo, Efficient Computation of Multiscale Entropy over Short Biomedical Time Series Based on Linear State-Space Models, Complexity, 2017 (2017), 1768264. https://doi.org/10.1155/2017/1768264 doi: 10.1155/2017/1768264
    [62] T. Takahashi, Complexity of spontaneous brain activity in mental disorders, Prog. Neuropsychopharmacol. Biol. Psychiatry, 45 (2013), 258–266. https://doi.org/10.1016/j.pnpbp.2012.05.001 doi: 10.1016/j.pnpbp.2012.05.001
    [63] N. Huang, Z. Shen, S. Long, M. Wu, H. H. Shih, Q. Zheng, et al., The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis, Proc. Math. Phys. Eng. Sci., 454 (1998), 903–995. https://doi.org/10.1098/rspa.1998.0193 doi: 10.1098/rspa.1998.0193
    [64] N. E. Huang, Z. Wu, A review on Hilbert-Huang transform: Method and its applications to geophysical studies, Rev. Geophys., 46 (2008), 228–251. https://doi.org/10.1029/2007RG000228 doi: 10.1029/2007RG000228
    [65] F. R. Kschischang, The Hilbert Transform. Toronto: University of Toronto, 2006.
    [66] E. Abdulhay, P.Y. Guméry, J. Fontecave, P. Baconnier, Cardiogenic oscillations extraction in inductive plethysmography: Ensemble empirical mode decomposition, Annu. Int. Conf. IEEE Eng. Med. Biol. Soc., Minnesota, USA, 2009, 2240–2243. https://doi.org/10.1109/IEMBS.2009.5335004
    [67] X. Han, J. Peng, A. Cui, F. Zhao, Sparse Principal Component Analysis via Fractional Function Regularity, Math. Probl. Eng., 2020 (2020), 7874140. https://doi.org/10.1155/2020/7874140 doi: 10.1155/2020/7874140
    [68] C. K. Arthur, V. A. Temeng, Y. Y. Ziggah, Performance Evaluation of Training Algorithms in Backpropagation Neural Network Approach to Blast-Induced Ground Vibration Prediction, Ghana Mining J., 20 (2020), 20–33. https://doi.org/10.4314/gm.v20i1.3 doi: 10.4314/gm.v20i1.3
    [69] K. Kovarski, J. Malvy, R. K. Khanna, S. Arsène, M. Batty, M. Latinus, Reduced visual evoked potential amplitude in autism spectrum disorder, a variability effect?, Translational Psychiatry, 9 (2019), 341. https://doi.org/10.1038/s41398-019-0672-6 doi: 10.1038/s41398-019-0672-6
    [70] S. A. Nastase, V. Iacovella, B. Davis, U. Hasson, Connectivity in the human brain dissociates entropy and complexity of auditory inputs, NeuroImage, 31 (2015), 292–300. https://doi.org/10.1016/j.neuroimage.2014.12.048 doi: 10.1016/j.neuroimage.2014.12.048
    [71] P. Barttfeld, B. Wicker, S. Cukier, S. Navarta, S. Lew, M. Sigman, A big-world network in ASD: dynamical connectivity analysis reflects a deficit in longrange connections and an excess of short-range connections, Neuropsychologia, 49 (2015), 254–263. https://doi.org/10.1016/j.neuropsychologia.2010.11.024 doi: 10.1016/j.neuropsychologia.2010.11.024
    [72] H. Zhang, R. Li, X. Wen, Q. Li, X. Wu, Altered Time-Frequency Feature in Default Mode Network of Autism Based on Improved Hilbert-Huang Transform, IEEE J. Biomed. Health Informatics, 25 (2021), 485–492. https://doi.org/10.1109/JBHI.2020.2993109 doi: 10.1109/JBHI.2020.2993109
    [73] T. Wadhera, D. Kakkar, Conditional entropy approach to analyze cognitive dynamics in autism spectrum disorder, Neurol. Res., 42 (2020), 869–878. https://doi.org/10.1080/01616412.2020.1788844 doi: 10.1080/01616412.2020.1788844
    [74] E. Gani, N. Handayani, S. H. Pratama, N. Afif, F. Aziezah, A. C. Keintjem, et al., Brainwaves Analysis Using Spectral Entropy in Children with Autism Spectrum Disorders (ASD), J. phys. Conf. ser., 1505 (2020), 012070. https://doi.org/10.1088/1742-6596/1505/1/012070 doi: 10.1088/1742-6596/1505/1/012070
    [75] E. Amiot, Entropy of Fourier coefficients of periodic musical objects, J. Math. Music, 15 (2021), 235–246. https://doi.org/10.1080/17459737.2020.1777592 doi: 10.1080/17459737.2020.1777592
    [76] D. Abásolo, R. Hornero, P. Espino, D. Alvarez, J. Poza, Entropy analysis of the EEG background activity in Alzheimer's disease patients, Physiol. Meas., 27 (2006), 241–253. https://doi.org/10.1088/0967-3334/27/3/003 doi: 10.1088/0967-3334/27/3/003
    [77] J. Han, Y. Li, J. Kang, E. Cai, Z. Tong, G. Ouyang, et al., Global Synchronization of Multichannel EEG Based on Rényi Entropy in Children with Autism Spectrum Disorder, Appl. Sci., 7 (2017), 257. https://doi.org/10.3390/app7030257 doi: 10.3390/app7030257
    [78] E. Abdulhay, M. Alafeef, H. Hadoush, N. Arunkumar, Resting State EEG-based Diagnosis of Autism via Elliptic Area of Continuous Wavelet Transform Complex Plot, J. Intell. fuzzy syst., 39 (2020), 8599–8607. https://doi.org/10.3233/JIFS-189176 doi: 10.3233/JIFS-189176
    [79] R. Okazaki, T. Takahashi, K. Ueno, K. Takahashi, M. Ishitobi, M. Kikuchi, et al., Changes in EEG complexity with electroconvulsive therapy in a patient with autism spectrum disorders: a multiscale entropy approach, Front. Hum. Neurosci., 9 (2015), 25767444. https://doi.org/10.3389/fnhum.2015.00106 doi: 10.3389/fnhum.2015.00106
    [80] S. Thapaliya, S. Jayarathna, M. Jaime, Evaluating the EEG and eye movements for autism spectrum disorder, 2018 IEEE Int. Conf. Big Data, Seattle, WA, USA, 2018. https://doi.org/10.1109/BigData.2018.8622501
    [81] J. Eldridge, A. E. Lane, M. Belkin, S. Dennis, Robust features for the automatic identification of autism spectrum disorder in children, J. Neurodev. Disord., 6 (2014), 1–12. https://doi.org/10.1186/1866-1955-6-12 doi: 10.1186/1866-1955-6-1
    [82] H. Amoud, H. Snoussi, D. Hewson, M. Doussot, J. Duchêne, Intrinsic mode entropy for nonlinear discriminant analysis, IEEE Signal Process. Lett., 14 (2007), 297–300. https://doi.org/10.1109/LSP.2006.888089 doi: 10.1109/LSP.2006.888089
    [83] M. Hu, H. Liang, Adaptive multiscale entropy analysis of multivariate neural data, IEEE Trans. Biomed. Eng., 59 (2012), 12–15. https://doi.org/10.1109/TBME.2011.2162511 doi: 10.1109/TBME.2011.2162511
    [84] O. Dekhil, M. Ali, Y. E. Nakeib, A. Shalaby, A. Soliman, A. Switala, et.al., A Personalized Autism Diagnosis CAD System Using a Fusion of Structural MRI and Resting-State Functional MRI Data. Front. Psychiatry, 10 (2021), 1–16. https://doi.org/10.3389/fpsyt.2019.00392 doi: 10.3389/fpsyt.2019.00392
    [85] O. Dekhil, M. Ali, R. Haweel, Y. Elnakeib, M. Ghazal, H. Hajjdiab, et.al. A Comprehensive Framework for Differentiating Autism Spectrum Disorder From Neurotypicals by Fusing Structural MRI and Resting State Functional MRI, Seminars in Pediatric Neurology., 34 (2020), 100805. https://doi.org/10.1016/j.spen.2020.100805 doi: 10.1016/j.spen.2020.100805
    [86] K. Barik, K. Watanabe, J. Bhattacharya, G. Saha, Classification of Autism in Young Children by Phase Angle Clustering in Magnetoencephalogram Signals, 2020 National Conf. Commun. (NCC), Kharagpur, India, 2020, 1–6. https://doi.org/10.1109/NCC48643.2020.9056022
  • This article has been cited by:

    1. Heloísa Oss Boll, Ali Amirahmadi, Mirfarid Musavian Ghazani, Wagner Ourique de Morais, Edison Pignaton de Freitas, Amira Soliman, Farzaneh Etminani, Stefan Byttner, Mariana Recamonde-Mendoza, Graph neural networks for clinical risk prediction based on electronic health records: A survey, 2024, 151, 15320464, 104616, 10.1016/j.jbi.2024.104616
    2. Richard John Woodman, Bogda Koczwara, Arduino Aleksander Mangoni, Applying precision medicine principles to the management of multimorbidity: the utility of comorbidity networks, graph machine learning, and knowledge graphs, 2024, 10, 2296-858X, 10.3389/fmed.2023.1302844
    3. Shirui Yu, Aihua Li, Yifei Chen, Dechao Wang, Xiaoli Tang, Heterogeneous network-based algorithms in the biomedical data mining: A review from technical perspective, 2024, 1, 29499534, 111, 10.1016/j.infoh.2024.07.002
    4. Richard J. Woodman, Arduino A. Mangoni, A comprehensive review of machine learning algorithms and their application in geriatric medicine: present and future, 2023, 35, 1720-8319, 2363, 10.1007/s40520-023-02552-2
  • Reader Comments
  • © 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(3461) PDF downloads(165) Cited by(3)

Figures and Tables

Figures(7)  /  Tables(5)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog