Loading [MathJax]/jax/output/SVG/jax.js
Research article Special Issues

EMD-based analysis of complexity with dissociated EEG amplitude and frequency information: a data-driven robust tool -for Autism diagnosis- compared to multi-scale entropy approach

  • Objective: Autism spectrum disorder (ASD) is usually characterised by altered social skills, repetitive behaviours, and difficulties in verbal/nonverbal communication. It has been reported that electroencephalograms (EEGs) in ASD are characterised by atypical complexity. The most commonly applied method in studies of ASD EEG complexity is multiscale entropy (MSE), where the sample entropy is evaluated across several scales. However, the accuracy of MSE-based classifications between ASD and neurotypical EEG activities is poor owing to several shortcomings in scale extraction and length, the overlap between amplitude and frequency information, and sensitivity to frequency. The present study proposes a novel, nonlinear, non-stationary, adaptive, data-driven, and accurate method for the classification of ASD and neurotypical groups based on EEG complexity and entropy without the shortcomings of MSE. Approach: The proposed method is as follows: (a) each ASD and neurotypical EEG (122 subjects × 64 channels) is decomposed using empirical mode decomposition (EMD) to obtain the intrinsic components (intrinsic mode functions). (b) The extracted components are normalised through the direct quadrature procedure. (c) The Hilbert transforms of the components are computed. (d) The analytic counterparts of components (and normalised components) are found. (e) The instantaneous frequency function of each analytic normalised component is calculated. (f) The instantaneous amplitude function of each analytic component is calculated. (g) The Shannon entropy values of the instantaneous frequency and amplitude vectors are computed. (h) The entropy values are classified using a neural network (NN). (i) The achieved accuracy is compared to that obtained with MSE-based classification. (j) The consistency of the results of entropy 3D mapping with clinical data is assessed. Main results: The results demonstrate that the proposed method outperforms MSE (accuracy: 66.4%), with an accuracy of 93.5%. Moreover, the entropy 3D mapping results are more consistent with the available clinical data regarding brain topography in ASD. Significance: This study presents a more robust alternative to MSE, which can be used for accurate classification of ASD/neurotypical as well as for the examination of EEG entropy across brain zones in ASD.

    Citation: Enas Abdulhay, Maha Alafeef, Hikmat Hadoush, V. Venkataraman, N. Arunkumar. EMD-based analysis of complexity with dissociated EEG amplitude and frequency information: a data-driven robust tool -for Autism diagnosis- compared to multi-scale entropy approach[J]. Mathematical Biosciences and Engineering, 2022, 19(5): 5031-5054. doi: 10.3934/mbe.2022235

    Related Papers:

    [1] Jihong Yang, Hao Xu, Congshu Li, Zhenhao Li, Zhe Hu . An explorative study for leveraging transcriptomic data of embryonic stem cells in mining cancer stemness genes, regulators, and networks. Mathematical Biosciences and Engineering, 2022, 19(12): 13949-13966. doi: 10.3934/mbe.2022650
    [2] Ahmed Hammad, Mohamed Elshaer, Xiuwen Tang . Identification of potential biomarkers with colorectal cancer based on bioinformatics analysis and machine learning. Mathematical Biosciences and Engineering, 2021, 18(6): 8997-9015. doi: 10.3934/mbe.2021443
    [3] Huiqing Wang, Xiao Han, Jianxue Ren, Hao Cheng, Haolin Li, Ying Li, Xue Li . A prognostic prediction model for ovarian cancer using a cross-modal view correlation discovery network. Mathematical Biosciences and Engineering, 2024, 21(1): 736-764. doi: 10.3934/mbe.2024031
    [4] Lingling Li, Yulu Hu, Xin Li, Tianhai Tian . Mathematical modeling the gene mechanism of colorectal cancer and the effect of radiation exposure. Mathematical Biosciences and Engineering, 2024, 21(1): 1186-1202. doi: 10.3934/mbe.2024050
    [5] Siyuan Tian, Yinan Hu, Chunmei Yang, Jiahao Yu, Jingyi Liu, Guoyun Xuan, Yansheng Liu, Keshuai Sun, Miao Zhang, Shuoyi Ma, Yulong Shang, Xia Zhou, Ying Han . A novel immune checkpoint-related gene signature for hepatocellular carcinoma to predict clinical outcomes and therapeutic response. Mathematical Biosciences and Engineering, 2022, 19(5): 4719-4736. doi: 10.3934/mbe.2022220
    [6] Jiaxin Luo, Lin Wu, Dinghui Liu, Zhaojun Xiong, Linli Wang, Xiaoxian Qian, Xiaoqiang Sun . Gene regulatory network analysis identifies key genes and regulatory mechanisms involved in acute myocardial infarction using bulk and single cell RNA-seq data. Mathematical Biosciences and Engineering, 2021, 18(6): 7774-7789. doi: 10.3934/mbe.2021386
    [7] Dong-feng Li, Aisikeer Tulahong, Md. Nazim Uddin, Huan Zhao, Hua Zhang . Meta-analysis identifying epithelial-derived transcriptomes predicts poor clinical outcome and immune infiltrations in ovarian cancer. Mathematical Biosciences and Engineering, 2021, 18(5): 6527-6551. doi: 10.3934/mbe.2021324
    [8] Yong Ding, Jian-Hong Liu . The signature lncRNAs associated with the lung adenocarcinoma patients prognosis. Mathematical Biosciences and Engineering, 2020, 17(2): 1593-1603. doi: 10.3934/mbe.2020083
    [9] Jie Wang, Md. Nazim Uddin, Rehana Akter, Yun Wu . Contribution of endothelial cell-derived transcriptomes to the colon cancer based on bioinformatics analysis. Mathematical Biosciences and Engineering, 2021, 18(6): 7280-7300. doi: 10.3934/mbe.2021360
    [10] Bin Ma, Lianqun Cao, Yongmin Li . A novel 10-gene immune-related lncRNA signature model for the prognosis of colorectal cancer. Mathematical Biosciences and Engineering, 2021, 18(6): 9743-9760. doi: 10.3934/mbe.2021477
  • Objective: Autism spectrum disorder (ASD) is usually characterised by altered social skills, repetitive behaviours, and difficulties in verbal/nonverbal communication. It has been reported that electroencephalograms (EEGs) in ASD are characterised by atypical complexity. The most commonly applied method in studies of ASD EEG complexity is multiscale entropy (MSE), where the sample entropy is evaluated across several scales. However, the accuracy of MSE-based classifications between ASD and neurotypical EEG activities is poor owing to several shortcomings in scale extraction and length, the overlap between amplitude and frequency information, and sensitivity to frequency. The present study proposes a novel, nonlinear, non-stationary, adaptive, data-driven, and accurate method for the classification of ASD and neurotypical groups based on EEG complexity and entropy without the shortcomings of MSE. Approach: The proposed method is as follows: (a) each ASD and neurotypical EEG (122 subjects × 64 channels) is decomposed using empirical mode decomposition (EMD) to obtain the intrinsic components (intrinsic mode functions). (b) The extracted components are normalised through the direct quadrature procedure. (c) The Hilbert transforms of the components are computed. (d) The analytic counterparts of components (and normalised components) are found. (e) The instantaneous frequency function of each analytic normalised component is calculated. (f) The instantaneous amplitude function of each analytic component is calculated. (g) The Shannon entropy values of the instantaneous frequency and amplitude vectors are computed. (h) The entropy values are classified using a neural network (NN). (i) The achieved accuracy is compared to that obtained with MSE-based classification. (j) The consistency of the results of entropy 3D mapping with clinical data is assessed. Main results: The results demonstrate that the proposed method outperforms MSE (accuracy: 66.4%), with an accuracy of 93.5%. Moreover, the entropy 3D mapping results are more consistent with the available clinical data regarding brain topography in ASD. Significance: This study presents a more robust alternative to MSE, which can be used for accurate classification of ASD/neurotypical as well as for the examination of EEG entropy across brain zones in ASD.



    Colorectal cancer (CRC), also named as colon cancer, is among top 3 common cancer diseases worldwide. With the fast development of powerful bioinformatics tools like microarray and high-throughput sequencing technique, the researchers are able to have a peek into the gene expression level of such disease. Omics data-based modeling and analysis does not only provide opportunity of systematically uncover the molecular mechanism underlying CRC development but also facilitate designing more effective treatments and prognosis.

    Many studies focus on exploring the gene expression profile to select differentially expressed genes and connect their known gene function with the disease. In such analysis, genes are viewed as independent molecules and their connections are often ignored. As such, the conventional differential gene expression studies fall short on investigating key genes at systems level. Network-based methods have been used to analyze high-throughput transcriptomic data of cancer patients [1]. The prevalent methods of reconstructing gene networks from cross-sectional clinical transcriptomic data mainly include correlational methods, such as PCC-based methods [2], mutual information [3], regression methods [4] or machining-learning methods [5], which can only infer correlations or associations between genes. However, these network reconstruction methods cannot infer directions of the edges in the network which is important for detecting regulatory relationship between genes underlying disease development.

    To reconstruct gene regulatory networks (GRNs) from gene expression data, many methods have also been developed in the past years [6]. The typical methods include Boolean network methods [7], ordinary differential equation (ODE) methods [8,9], Bayesian network methods [10]. These methods usually require time-course gene expression data [11] for inferring directed GRN. However, the temporal information is always not available for the clinical transcriptomic data of patients, which restricted to apply these methods.

    On the other hand, many studies employed static network approach to bridge the connection between molecular profile and biological functions [1]. However, cancer is a dynamic evolving disease and the intrinsic molecular interactions should also have dynamic changing patterns. Therefore, constructing dynamic regulatory networks that are rewiring at different stages is a rational way to understand the mechanism underlying cancer development.

    In this paper, to detect dynamic rewiring of gene regulation network along different disease stage of colorectal cancer, we reconstruct the stage-specific gene regulatory networks from a combined colorectal cancer dataset. We first used a supervised approach to select genes for network construction and analysis by integrating genes’ prognostic significance, functional association and correlation relationship and dynamic trends. An initial gene network was constructed using mutual information, which was then refined by employing a dynamic system model-based optimization approach to reconstruct the dynamic GRNs based on different colorectal cancer stages. The dynamic changes of the GRNs were analyzed to reveal important genes and their regulations during CRC development.

    In this article, four datasets of different sources are collected from both local hospital and gene expression omnibus (GEO) repository. GSE12945 is collected from Germany Max Planck Institute [12]; GSE14333 is collected from Australia Melbourne Royal Hospital [13] and GSE17536 is collected from United States Moffitt Cancer Center [14]. We also used a dataset of a Chinese cohort in our previous study, where 67 colorectal samples are processed through microarray analysis [15]. Both the gene expression data and clinical information of patients were collected.

    After close examination of the datasets, we found that the sample number of different cancer grade is unequally distributed among the four datasets, in which GSE12945 does not include grade A and grade D samples while GSE14333 does not include grade D samples. The best way to solve the inequality of sample number is to combine them together as a largely merged dataset, where the sample number of each grade is more balanced. Using selected variables including GSE IDs, overall survival months, living status, gender, cancer grade, age at diagnosis and data source, we combined the 4 different datasets together, resulting in 529 samples with valid phenotype information. Detailed information can be found in Table 1. We used ‘removeBatchEffect’ in limma R package to normalize the merged data and to reduce the between-experiments batch effects, which is used in the downstream analysis.

    Table 1.  Detailed information about four colorectal cancer datasets and merged dataset.
    Chinese cohort Miffitt (GSE17536) Max Planck (GSE12945) Melbourne (GSE14333) Merge (no NAs)
    Sample (n) 67 177 62 226 529
    Male, n (%) 41 (61) 96 (54) 34 (55) 120 288 (54.4)
    Female, n (%) 26 (39) 81 (46) 28 (45) 106 241 (45.6)
    Age (years)
    Median 59 66 67 66
    Range 19–92 26–92 26–92 19–92
    Cancer Grade
    A 4 24 0 41 61
    B 16 57 31 94 276
    C 29 57 31 91 177
    D 18 39 0 0 15
    Deaths 19 73 12 176 262
    Median Survival Time (days) 1617.5 1268.1 1395 1153.8 1161.6

     | Show Table
    DownLoad: CSV

    Mutual information can be used to measure the nonlinear correlation relationship between two genes, which is a bilateral relation measurement. We use I(A,B) to denote the mutual information between gene A and gene B. If I(A,B)=0, then two gene expression patterns are uncorrelated, while larger I(A,B) stands for closer pattern similarity and potentially implies close biological relationship.

    In order to construct a mutual information network, we first calculate the mutual information value for each pair of genes, then a threshold is set to construct the connection network. In this step, the estimation method of mutual information plays a pivotal role in the performance of the ultimate mutual information network. Currently, two methods are commonly used to estimate the mutual information. First is to increase the accuracy of the estimation of joint probability density function from probability density estimation, such as histogram method and kernel density method [16]. The second way is to avoid the estimation of joint probability density function, such as K-nearest neighbor (KNN) method [17].

    In terms of gene expression data, kernel density estimation is a better way because of its advantage over the high-dimensional dataset and the intrinsic normal distribution pattern of gene expression values. Define X1,X2,,Xn as independent identically distributed random variables with joint probability density function f(x), then the kernel density estimation for function f(x) is

    ˆfh(x)=1nhni=1K(xixh),xR (1)

    where K(·) is called kernel function, and h is a pre-defined positive number, often referred as bandwidth funcion.

    From the calculation of mutual information above, we know that the estimation for mutual information I(X,Y) is ˆI(X,Y), and it reads

    ˆI(X,Y)=X,Yˆf(x,y)logˆf(x,y)ˆf(x)ˆf(y)dxdy. (2)

    Based on the above MI estimation of each pair of genes, we apply data process inequality (DPI) algorithm [18] to eliminate indirect associations between genes. In this way, a sparse network could be constructed, which is more suitable for the following regulatory network estimation.

    Given the following gene interactions: XYZ, where Z is correlated with Y but not with X, then the DPI algorithm considers that

    I(X,Y)I(X,Z) and I(Y,Z)I(X,Z) (3)

    That is, if (X, Y) and (Y, Z) are correlated directly but (X, Z) is indirectedly correlated through Y, then I(X,Z)I(X,Y), and I(X,Z)I(Y,Z). Using the DPI algorithm, we could exclude those indirect correlations, and retain those estimated to be directly associated gene pairs. We use eij to denote the edges in the MI network after network sparsification using DPI algorithm. eij=1 if there is a direct association between gene i and gene j, otherwise eij=0.

    Assume xi(t) to be the expression level of gene i at time t, then the ordinary differetial equation (ODE) of transcriptionl kinetics would be

    dxi(t)dt=jRieij·βij·xj(t)αi·xi(t) (4)

    where αi is the mRNA turnover rate, Ri is the set of regulators of gene i and βij is the regularotry strength from gene j to gene i. We incorporate eij derived from the MI network into the above ODE model to exclude indirectly-correlated gene pairs from the network model for the purpose of reducing computational cost in the following parameter estimation steps. If the time-course data are provided, this equation can be utilized to construct a dynamic gene regulation networks (GRN), and the coefficients can be estimated by a linear regression method. The coefficients can also be used to demonstrate the gene interaction in a GRN.

    Many available cancer datasets for research use is stage-static data, in which all the case with the same cancer progression grade will be counted without time-course data. Thus, it makes people hard to examine the dynamic gene evolving process. In order to overcome such obstacle, we employed a dynamic cascaded methods (DCM) algorithm [19] (Text S1) to reconstruct the dynamic gene networks from sample-based transcriptional data.

    According to the intra-stage steady-rate assumption and the continuity assumption [19], we are able to build up a model that connects two consecutive stages in terms of gene profile.

    ˉx(s)i=a(s1,s)i·ˉx(s1)i+jRi(s1)(bij(s1,s)·ˉx(s1)j)+jRi(bij(s)·ˉx(s)j) (5)

    where ˉx(s)i and ˉx(s1)i stand for the mean expressions of gene i in stage s and s-1. In addition, the coefficients in this formula is closely related to the previous ODE formula:

    a(s1,s)i=2αiL(s1)2+αiL(s),bij(s1, s)=L(s1)2+αiL(s)·eij·β(s1)ij, bijs=L(s)2+αiL(s)·eij·β(s)ij (6)

    where L(s) is the time length of stage s, αi is the degradation rate, and β(s)ij is the regulatory coefficient.

    In order to describe the dynamic pattern within a stage, we should define the λ fraction of a stage as the proportional interpolation between the earliest and latest time points of the stage. According to the previous two assumptions, gene expression should be of linear trend change continuously, hence the largest and smallest gene expression should be on the either end of a gene expression stage. t(s)1 and t(s)N(s) denotes the earliest and the latest time points of stage s. The time of the λ fraction in stage s can be expressed as t(s)λ=   t(s)1+λ·(t(s)N(s)t(s)1). After certain transformations, the dynamic model equation can be written as:

    x(s)i(t(s)λ)=a(s1,s)i·x(s1)i(t(s1)λ)+jRi(s1)(bij(s1, s)·x(s1)j(t(s1)λ))                 +jRi(s)(bij(s)·x(s)j(t(s)λ)) (7)

    In this equation, gene expressions at the same (λ) fraction of two consecutive stages are connected. It describes the inter-stage dynamics of the GRN. And the intra-stage dynamical GRN can be described by the former ODE equation.

    The details of the DCM algorithm are described in Text S1. All the steps listed above are performed in R (version 3.3.3) using packages including parallel, annotate, hgu133a.db, igraph, and most importantly, glmnet package that allows to fast implementation of elasticnet regression in selecting appropriate penalty coefficient and equation coefficients.

    In the merged dataset, there are about an equal number of male and female samples, counting for 54.4 and 45.6% respectively. The median age in those patients is 66 years old, corresponding to the higher hazard rate for old people to suffer from colorectal cancer. There are also young patients, who are only 19 years old, while the oldest patients are in their 90s. When it comes to the number of samples for each cancer grade, Grade B has the largest group of samples, while Grade D has the lowest group of samples, which is only 15. This phenomenon is also reasonable considering the terrible prognosis and lack of proper treatment for colorectal cancer in its late days. Throughout the whole survey, there are in total 262 recorded death cases and the median survival time length by day unit is 1161.1.

    To reconstruct dynamic GRNs during CRC progression, we focus on cancer progression-related genes that should be functionally important and associated to the survival of patients. So, we used Cox PH model in this step to select the survival-associated genes for downstream analysis. The log-rank test p-value was calculated to assess the significance of the association between a given gene and survival of patients. We selected 1868 genes with p-value less than 0.01 for the downstream analysis, to reduce probability of Type I error.

    We next used Gene Ontology to perform gene set enrichment analysis [20]. The most common method for GO enrichment analysis is the Fisher Exact Test, where contingency tables are made for each GO terms about the number of genes have or don’t have the annotation and whether or not the genes are considered significant. 1868 survival-associated genes (log-rank p value < 0.01) were subject to gene set enrichment analysis. ‘topGO’ R package was used to perform the GO enrichment analysis and the top 10 GO functions are selected in Table 2, ranked by Kolmogorov-Smirnov test with more conservative ‘elim’ algorithm, which will render a more confident enrichment analysis result. From Table 2 we can see that the most enriched GO terms are closely related to the regulation of ubiquitin-protein, cell division, DNA elongation and replication and protein transportation, which have been verified to be closely related to cancer progression. For example, the deregulated ubiquitin ligases (E3s) have been experimentally validated to play important roles in tumor progression and cancer drug resistance [21]. For another example, protein trafficking has been shown to facilitate invasion and metastasis of CRC cells, and impaired transportation of member receptors represents important mechanisms of resistance to CRC-targeted therapy [22]. Furthermore, cell division and DNA elongation and replication are well-known biological processes associated with progression of many cancers including CRC [23].

    Table 2.  GO Enrichment Analysis Result.
    GO.ID Annotated Significant Expected KS test p value Term
    GO: 0051436 111 12 9.1 3.90 × 10−5 negative regulation of ubiquitin-protein ligase activity involved in mitotic cell cycle
    GO: 0051437 120 13 9.83 8.80 × 10−5 positive regulation of ubiquitin-protein ligase activity involved in regulation of mitotic cell cycle transition
    GO: 0051301 814 72 66.71 1.10 × 10−4 cell division
    GO: 0006273 13 4 1.07 1.30 × 10−4 lagging strand elongation
    GO: 0045740 104 18 8.52 1.30 × 10−4 positive regulation of DNA replication
    GO: 0030049 59 7 4.84 1.30 × 10−4 muscle filament sliding
    GO: 0035999 14 4 1.15 1.40 × 10−5 tetrahydrofolate interconversion
    GO: 0009157 14 3 1.15 1.60 × 10−5 deoxyribonucleoside monophosphate biosynthetic process
    GO: 0032596 12 3 0.98 1.80 × 10−5 protein transport into membrane raft
    GO: 0031145 111 12 9.1 3.90 × 10−5 anaphase-promoting complex-dependent catabolic process

     | Show Table
    DownLoad: CSV

    To select genes with known direct interaction or indirect associations and narrow the range to select both statistically and biologically significant genes in colorectal cancer, we used STRING database (https://string-db.org/) to construct a PPI network using 1868 genes identified in the previous step as input. The PPI link tsv file is downloaded and 249 unique genes are selected in consideration of potential biological meanings.

    Because of the multiple Affymetrix IDs for the same gene symbol, we here have more than 249 genes selected out of the merged gene expression matrix. Eventually, a matrix used to plot heatmap is selected with 539 genes as rows and 529 patients as columns. The heatmap (Figure 1) shows that these genes are largely correlated with living status, which is shown in the first row of column side colors (green for live and red for death). Many living patients are associated with the higher expression level of these 539 selected genes.

    Figure 1.  Heatmap plot of gene expression (539 genes of 529 patients).

    Using 249 genes selected from the previous STRING PPI network, we calculated mutual information for all pairs of genes and used the above-mentioned DPI algorithm to exclude indirectly associated gene pairs. In order to demonstrate the network structure, the Fruchterman-Reingold algorithm is used to automatically display the network as implemented in igraph R package. Because of the relatively large number of genes in our network, it’s inappropriate to show them all together without a certain difference, hence we modify the text size, edge width and node size in the plot to help better visualize the weight of each gene. The text size is set to be proportionate to the ratio of node degree over maximum node degree of the network; edge width is set to be proportionate to the ratio of log transformed edge weight (which is also the mutual information value between gene pairs) over the log transformed maximum edge weight; and the node size is set to be the ratio of the sum of the mutual information value between one gene and all the other genes, over the maximum number of it.

    The visualization of the constructed mutual information network is shown in Figure 2. We also examined the degree distribution (Figure 3a) and cumulative degree distribution (Figure 3b) for the mutual information network. Most genes (about 45%) have only one connection, verifying the sparsity of the constructed MI network using ARACNE method and DPI algorithm. High-degree genes in gene networks are often considered as hub genes or potential key genes which may possess important biological functions. In our analysis, we found that gene CCT7 does not only has the largest text size but also it has the node size (Figure 2), indicating that CCT7 has both the largest connection with other genes and the sum of mutual information value. Importantly, CCT7 has been experimentally validated to be closely associated with some cancers, such as endometrial carcinoma [24]. This generates hypothesis of CRC as a key gene in CRC for experimental tests.

    Figure 2.  Mutual information network for the selected 249 genes.
    Figure 3.  Node degree distribution of mutual information network. (A) Separate degree distribution. (B) cumulative degree distribution.

    The mutual information network enables us to have a big picture of the gene association within CRC, however, it cannot provide more information about the association direction or the level of such association. To investigate the gene regulatory roadmap underlying CRC, we next infer the directed regulatory networks for each stage of CRC.

    A smaller gene expression matrix (with 127 Affymetrix probes and 529 patients) is extracted from the original matrix (with 22277 Affymetrix probes and 529 patients). The genes are selected based on the following criteria: Survival association significance assessed (genes with log-rank test p-value less than 0.01), PPI interaction information (249 genes in String database) and mutual information network information (genes with the sum of mutual information > 1). The column number of the gene expression matrix corresponds with the phenotype dataset (which has 529 patient entries). Sample-based gene-profiling data of different colorectal cancer grade is created with 61 Grade A, 276 Grade B, 177 Grade C and 15 Grade D patients. The difference in the cancer grade represents the stepwise cancer progression process from early stage to late stage.

    To determine the gene expression evolving direction, a gene evolving trend analysis was conducted. The overall connecting error is used to help to solve this problem. It is defined as the L1-norm of all the individual connecting errors:

    SS=2|x(t(s1)N(s1))x(t(s)1)| (8)

    where t(s)1 and t(s1)N(s1) are the starting times of stage s and the ending time of stage s-1 respectively, x(t(s)1) and x(t(s1)N(s1)) are the corresponding gene expression level of one gene. According to the continuity assumption, x(t(s)1) and x(t(s1)N(s1)) should be the same. However, in practice, there could be error at the connecting point of two stages. By summing up the overall connecting error, we can minimize it in order to get the shortest path from grade A to grade D. Then, this path is used to determine the gene evolving trend within each stage. Figure S2 illustrates the possible combinations between each stage.

    The gene trend analysis is realized in R by using igraph package. First, the grade separated gene-profiling data is generated as large list object, in which every list in the root is the expression profile, and four sublists contain different gene expressions in different cancer stages. Then, quantile function is used to select genes according to a different λ   fraction in a stage. λ is selected to be 0, 0.5, and 1 for interpolation. As for extrapolation, λhead and λtail is set to be both at 4 for better controlling the linearity of the model equations. After that, a bootstrapping method is used to exploit the limited samples. We heuristically set the bootstrap sample size to half of the selected genes, and a total of 6150 bootstrap groups are generated for all the genes, and a total of 30750 corresponding model equations are obtained. The model coefficients in Eq (7) are then estimated by a regularized regression method. In this study we used elastic-net method to guarantee both sparsity and robustness of the estimation. We formulated the following objective function with a penalty to estimate bij(s) for each s,

    min{x(S)i(t(s)λ)+a(S1,S)ix(S1)i(t(s1)λ)jRi(s1)(b(s1,S)ijx(s1)j(t(s1)λ))        jRi(s)(b(s)ijx(s)j(t(s)λ))22+θ[1αela2P1+αelaP2]},s=2,3,4. (9)

    where αela is an elasticnet mixing parameter (between 0 and 1), and θ is the penalty coefficient. P1=i(a(s1,s)i)2+ ij( bij(s1, s))2+  ij( bij(s))2 , and P2=i|a(s1,s)i|+ ij| bij(s1, s)|+  ij| bij(s)| .

    Cross validation strategy is also used to select the optimal θ. After getting the bij(s) matrix, we then convert them to β(s)ij matrix according to previous formula Eq (6). Then a gene regulatory network (GRN) is constructed according to these β(s)ij. The network construction process is repeated 50 times, and ultimately 50 networks are obtained. The confidence of a connection is calculated as occurring frequency among 50 networks, and network connectivity PER is defined as the number of connections reserved in a network.

    A sequence of αela is selected in our model to optimize the sparse gene regulatory network. Confidence-Connectivity plot (C-C plot) is used to demonstrate the relationship between the threshold confidence level of a network and the network connectivity. From the Figure S3, we found that the GRNs of Grade A and C have a similar curve, while Grade B is a little higher in the network confidence level as the network connectivity increases. Grade D curve has a similar descending curve to Grade A and C in the beginning, but then diverts from all the other 3 stages and descends more slowly. Among 15006 possible connections (with direction) for a set of 123 genes in colorectal cancer, most connections have relatively low confidence, in other words, low frequency of occurrence. The connection confidence decreases following an exponential trend as the network connectivity increases. By setting a cutoff value of 30% confidence level, there is no more than 40% connectivity in either grade of the GRN (2615, 4999, 3157 and 4542 connections in Grade A, B, C, and D respectively).

    Four stage-specific gene regulatory network (GRN) are reconstructed with the connectivity setting of 0.02. Four GRNs are then used by Cytoscape (http://www.cytoscape.org) and shown in Figure 4. In these figures, the node size is set to be proportionate to the sum of the products of the confidence and strength over all the direct incoming and outgoing connections. This product stands for the activity of a gene in the network. Whether the strength is for upregulation and downregulation of the target gene is marked with either an arrow (→) or a stop (┤). The level of the orange color of an edge is proportionate to the confidence level of the connection, and the width of the line is set corresponding to the strength of this regulation. The circular layout is ordered by the degree of the nodes.

    Figure 4.  Reconstructed dynamic GRNs for four stages of CRC.

    In the following, we discuss the biological implications of the reconstructed dynamic GRNs for CRC at each stage.

    During CRC stage A (Figure 4a), gene RPS15 has a big positive regulation over RPL37, and the former is negatively regulated by SFSWAP, MAPK4 and other two genes. One known connection with high confidence level is the negative regulation from MYH2 to ACTN2.

    During stage B (Figure 4b), RAPGEF1, ACTN2, MYH2, TGFA, MAPK4, SMG7, CHERP and DYRK1B are potential genes with biological significance. On literature review, RAPGEF1 gene have its first intron under somatic demethylation of a relaxed-criterion CpG island in 40% of colon cancers [25]. ACTN2, as isoform of ACTN4, whose amplifications has been verified to be associated with worse outcomes of cancer patients [26], is also believe to be critical to the outcome of CRC patients. MYH2 was found to be highly up-regulated (with 63-fold change) in CRC patients with tumors of high intrinsic COX-2 expression. TGFA is of the same family of TGF-beta, whose dysregulation has been shown to be closely related to colorectal cancer [27]. CHERP was found in Colorectal Cancer Atlas [28] with functions of negative regulation on cell proliferation.

    In CRC stage C (Figure 4c), there is an interesting pattern of negative regulation from gene BAK1 to several other genes such as RAPGEF1, MYH2. Although BAK1 has relatively small value of product sum (corresponding to gene activity), it has a broad regulation function on multiple genes. Another interesting pattern to notice is that genes with low activity might have a strong regulation effect on other genes, such as RPL37. This is reasonable because important regulatory genes are not always active while their effects could be tremendous.

    Figure 4d shows the stage D-specific regulatory network. The previous observation reoccurred that two low-expressed genes RPL37 and NPM1 have strong influence on the expression of each other as well as being regulated by many other active genes. From this pattern we hypothesize that those low-expressed but influential genes might be the downstream gene regulated by multiple less influential but high-expressed genes. The additive effect of the upstream genes might largely affect the downstream biological process.

    In this study, we employed an integrated approach to analyze clinical transcriptomic data of CRC patients. We are interested at the directional gene regulatory networks for each stage and their dynamic changes during CRC development. Driven by this question, we collected and merged 4 datasets of CRC containing both gene expression data and clinical information. To infer GRNs from the large-scale clinical gene expression dataset, the survival-associated genes were selected as input for network construction, and an initial mutual information network was constructed to reduce the computational cost of the parameter estimation of the dynamical system model of GRN. By introducing a new regularized optimization method into the DCM algorithm, directional GRNs were inferred for each stage of CRC.

    The DCM algorithm [19] employed in this paper relies on two assumptions: The intra-stage steady-rate assumption and the continuity assumption. The intra-stage steady-rate assumption or linear-dynamic assumption, models the dynamic gene expression profile with a linear trend within each stage of a process. The continuity assumption assumes that the gene expression trajectory is continuous both within each disease stage and at the connecting point of two adjacent stages. Obviously, these two assumptions are simplification of the real gene expression in patients. In reality, it’s largely plausible that the gene expression dynamics should be nonlinear, and the gene mutations or other genetic variations may result in discrete transition of gene expression trajectory. Therefore, the linear-dynamic assumption and the continuity assumption need to be generalized or waived to be more realistic.

    In addition, the number of samples in each stage (stage A to D) is uneven in our study. The minimal sample number in that study is 13, which is comparative to 15 in our paper. Although the previous study has shown that the DCM algorithm is not sensitive to the sample size (Figure 3B in [19]), the uneven sample number could be considered for improving the DCM algorithm. For example, this consideration may be addressed by assigning different penalties or weights for different groups according to the number of samples.

    In the current study, we focus on GRNs rewiring during CRC development from stage A to stage D. We did not consider normal samples of CRC since they are not available in the collected dataset, which is another limitation of our study. From the viewpoint of dynamic theory, disease progression can be considered a state transition, from a normal state, to pre-disease state and then to disease state. So, integration of normal samples into dynamic network analysis would reveal some important insights for cancer initiation and occurrence, which will be studied in our future study.

    Inferring gene regulatory network from high throughput data is a fundamental question in systems biology yet remains as a challenging task. Analyzing GRNs and their dynamic rewiring are instrumental to identifying master regulator genes or casual drivers in human disease at systems level. The existing methods for inferring GRNs largely depend on time-course gene expression data. However, large samples of time-course data are rarely available in clinical situations since longitudinal surveys are often challenging to conduct. In contrast, cross-sectional studies of a large population based on high-throughput molecular omics data are more prevalent due to their relative feasibility.

    The lack of temporal information in clinical transcriptomic data leads to a major challenge for inferring GRN and its translation to precision medicine. Therefore, how to decode the temporal information underlying disease progression is an interesting question and may be key to address the above challenge. The sample similarity-based approach in evolution and genetics studies [29], for instance, phylogenetic trees based on microarray data [30] and genetic linkage maps based on genetic markers [31], have shown developed and shown success in recovering evolutionary dynamics. Importantly, some studies have shown that it is possible to measure cancer progression by using gene expression profiles to estimate “distances” between tumors [32]. Another study used graph-based minimum spanning tree to recover biological progression from microarray samples [33], which can be applied for reconstructing temporal trajectory of differentiation, development, cell cycle or disease progression.

    In summary, in this study, we analyzed the dynamic changes of GRNs along different stages of CRC by using an integrated approach. We combined several clinical transcriptomic datasets of CRC and reconstructed GRNs of CRC across different stages by integrating gene selection, mutual information-based network sparcification and a previous DCM algorithm. Our study demonstrated that the dynamic rewiring of gene regulations could be inferred by reconstructing dynamic GRNs based on clinical transcriptomic profiling. Our study revealed several key genes underlying CRC development, which may generate new hypothesis for further experimental test. We anticipate new methods in future studies to quantitatively estimate disease progression based on the clinical cross-sectional transcriptomic data, which may be promising for developing dynamical system-based GRN inference approach.

    The gene expression datasets as well as clinical information of the patients were downloaded from the NCBI GEO database, as described in the main text.

    X Sun was supported by grants from the National Key R & D Program of China (2018YFC0910500), the National Natural Science Foundation of China (11871070), the Guangdong Basic and Applied Basic Research Foundation (2020B1515020047), Guangdong Key Field R & D Plan (2019B020228001)and the Opening Project of Guangdong Province Key Laboratory of Computational Science at the Sun Yat-sen University (2018003).

    The authors declare that they have no conflict of interest.

    X.S. designed research; A.D. performed research; A.D. and X.S. analyzed data and wrote the paper.



    [1] T. Hirota, R. So, Y. S. Kim, B. Leventhal, R. A. Epstein, A systematic review of screening tools in non-young children and adults for autism spectrum disorder, Res. Dev. Disabil., 80 (2018), 1–12. https://doi.org/10.1016/j.ridd.2018.05.017 doi: 10.1016/j.ridd.2018.05.017
    [2] C. Lord, S. Risi, P.S. Dilavore, C. Shulman, A. Thurm, A. Pickles. Autism from 2 to 9 years of age. Arch. Gen. Psychiatry, 63 (2006), 694–701. https://doi.org/10.1001/archpsyc.63.6.694 doi: 10.1001/archpsyc.63.6.694
    [3] B. B. Sizoo, E. H. Horwitz, J. P. Teunisse, C. C. Kan, C. Vissers, E. Forceville, et al., Predictive validity of self-report questionnaires in the assessment of autism spectrum disorders in adults, Autism, 19 (2015), 842–849. https://doi.org/10.1177/1362361315589869 doi: 10.1177/1362361315589869
    [4] P. O. Towle, P. A. Patrick, Autism Spectrum Disorder Screening Instruments for Very Young Children: A Systematic Review, Autism. Res. Treat., 2016 (2016), 4624829. https://doi.org/10.1155/2016/4624829 doi: 10.1155/2016/4624829
    [5] D. Bone, S. Bishop, M. P. Black, M. S. Goodwin, C. Lord, S. S. Narayanan, Use of machine learning to improve autism screening and diagnostic instruments: effectiveness, efficiency, and multi-Instrument Fusion, J. Child. Psychol. Psychiatry, 57 (2017), 927–937. https://doi.org/10.1111/jcpp.12559 doi: 10.1111/jcpp.12559
    [6] J. A. Kosmicki, V. Sochat, M. Duda, D. P. Wall, Searching for a minimal set of behaviors for autism detection through feature selection-based machine learning, Transl. Psychiatry, 5 (2015), 514–517. https://doi.org/10.1038/tp.2015.7 doi: 10.1038/tp.2015.7
    [7] F. Thabtah, Machine learning in autistic spectrum disorder behavioral research: A review and ways forward, Informatics Heal. Soc. Care, 44 (2019), 278–297. https://doi.org/10.1080/17538157.2017.1399132 doi: 10.1080/17538157.2017.1399132
    [8] D. H. Oh, I. B. Kim, S. H. Kim, D. H. Ahn, Predicting autism spectrum disorder using blood-based gene expression signatures and machine learning, Clin. Psychopharmacol. Neurosci., 15 (2017), 47–52. https://doi.org/10.9758/cpn.2017.15.1.47 doi: 10.9758/cpn.2017.15.1.47
    [9] M. Duda, R. Ma, N. Haber, D. P. Wall, Use of machine learning for behavioral distinction of autism and ADHD, Transl. Psychiatry, 6 (2016), 732. https://doi.org/10.1038/tp.2015.221 doi: 10.1038/tp.2015.221
    [10] G. Li, O. Lee, H. Rabitz, High-efficiency classification of children with autism spectrum disorder, PLoS One, 13 (2018), 1–23. https://doi.org/10.1371/journal.pone.0192867 doi: 10.1371/journal.pone.0192867
    [11] Q. Tariq, S. L. Fleming, J. N. Schwartz, K. Dunlap, C. Corbin, P. Washington, et al., Detecting Developmental Delay and Autism Through Machine Learning Models Using Home Videos of Bangladeshi Children: Development and Validation Study, J. Med. Internet Res., 21 (2019), 13822. https://doi.org/10.2196/13822 doi: 10.2196/13822
    [12] D. Eman, W. R. Emanuel, Machine Learning Classifiers for Autism Spectrum Disorder: A Review, 2019 4th Int. Conf. Inform. Technol. Inform. Syst. Electr. Eng. (ICITISEE), Yogyakarta, Indonesia, 2019. https://doi.org/10.1109/ICITISEE48480.2019.9003807
    [13] X. Bi, Y. Wang, Q. Shu, Q. Sun, Q. Xu, Classification of autism spectrum disorder using random support vector machine cluster, Frontiers in Genetics, 6 (2018), 9–18. https://doi.org/10.3389/fgene.2018.00018 doi: 10.3389/fgene.2018.00018
    [14] E. Grossi, C. Olivieri, M. Buscema, Diagnosis of autism through EEG processed by advanced computational algorithms: a pilot study, Comput. Methods Programs Biomed., 142 (2017), 73–79. https://doi.org/10.1016/j.cmpb.2017.02.002 doi: 10.1016/j.cmpb.2017.02.002
    [15] M. L. Raja, M. Priya, Neural network based classification of EEG signals for diagnosis of autism spectrum disorder, Int. J. Pharm. Bio. Sci., 8 (2017), 1020–1026.
    [16] L. Raja, M. M. Priyab, EEG based ASD diagnosis for children using auto-regressive features and FFNN, Int. J. Control Theo. App., 10 (2017), 27–32.
    [17] L. Raja, M. M. Priya, EEG based diagnosis of autism spectrum disorder using static and dynamic neural networks, ARPN J. Eng. Appl. Sci., 12 (2017), 4653787.
    [18] R. Djemal, K. AlSharabi, S. Ibrahim, A. Alsuwailem, EEG-based computer aided diagnosis of autism spectrum disorder using wavelet, entropy, and ANN, BioMed. Res. Int., 2017 (2017), 1–9. https://doi.org/10.1155/2017/9816591 doi: 10.1155/2017/9816591
    [19] T. M. Heunis, C. Aldrich, P. J. Vries, Recent Advances in Resting-State Electroencephalography Biomarkers for Autism Spectrum Disorder-A Review of Methodological and Clinical Challenges, Rev. Pediatr. Neurol., 61 (2016), 28–37. https://doi.org/10.1016/j.pediatrneurol.2016.03.010 doi: 10.1016/j.pediatrneurol.2016.03.010
    [20] N. P. Jordanova, J. P. Jordanov, Spectrum-weighted EEG frequency ("brain-rate") as a quantitative indicator of mental arousal. Prilozi, 26 (2005), 35–42.
    [21] E. Abdulhay, M. Alafeef, A. Abdelhay, A. Al-Bashir, Classification of Normal, Ictal and Inter-ictal EEG via Direct Quadrature and Random Forest Tree, J. Med. Biol. Eng., 37 (2017), 843–857. https://doi.org/10.1007/s40846-017-0239-z doi: 10.1007/s40846-017-0239-z
    [22] Z. Dandan, D. Haiyan, H. Xinlin, L. Yunfeng, Z. Congle, Y. Datian, The Combination of Amplitude and Sample Entropy in EEG and its Application to Assessment of Cerebral Injuries in Piglets, 2008 Int. Conf. BioMed. Eng. Informatics, Sanya, China, 2008. https://doi.org/10.1109/BMEI.2008.12
    [23] E. Abdulhay, M. Alafeef, L. Alzghoul, M. Al Momani, R. Al Abdi, N. Arunkumar, et al., Computer-aided autism diagnosis via second-order difference plot area applied to EEG empirical mode decomposition, Neural Comput. Appl., 32 (2020), 10947–10956. https://doi.org/10.1007/s00521-018-3738-0 doi: 10.1007/s00521-018-3738-0
    [24] R. J. Oweis, E. W. Abdulhay, Seizure classification in EEG signals utilizing Hilbert-Huang transform, Biomed. Eng. Online, 10 (2011), 38. https://doi.org/10.1186/1475-925X-10-38 doi: 10.1186/1475-925X-10-38
    [25] E. Abdulhay, M. Alafeef, H. Hadoush, N. Alomari, M. Bashayreh, Frequency 3D Mapping and Inter-Channel Stability of EEG Intrinsic Function Pulsation: Indicators Towards Autism Spectrum Diagnosis, 2017 10th Jordanian Int. Electric. Electron. Eng. Conf. (JIEEEC), Amman, Jordan, 2017. https://doi.org/10.1109/JIEEEC.2017.8051416
    [26] H. Hadoush, M. Alafeef, E. Abdulhay, Automated identification for autism severity level: EEG analysis using empirical mode decomposition and second order difference plot, Behavioural Brain Res., 362 (2019), 240–248. https://doi.org/10.1016/j.bbr.2019.01.018 doi: 10.1016/j.bbr.2019.01.018
    [27] E. Abdulhay, V. Elamaran, M. Chandrasekar, V. S. Balaji, and K. Narasimhan, Automated diagnosis of epilepsy from EEG signals using ensemble learning approach, Pattern Recognition Letters, 139 (2020), 174–181. https://doi.org/10.1016/j.patrec.2017.05.021 doi: 10.1016/j.patrec.2017.05.021
    [28] T. H. Pham, J. Vicnesh, J. K. Wei, S. J. Oh, N. Arunkumar, E. Abdulhay, et al., Autism spectrum disorder diagnostic system using HOS bispectrum with EEG signals, Int. J. Environ. Res. Public Health, 17 (2020), 1–14. https://doi.org/10.3390/ijerph17030971 doi: 10.3390/ijerph17030971
    [29] W. Bosl, A. Tierney, H. T. Flusberg, C. Nelson, EEG complexity as a biomarker for autism spectrum disorder risk, BMC Med., 9 (2011), 18. https://doi.org/10.1186/1741-7015-9-18 doi: 10.1186/1741-7015-9-18
    [30] F. H. Duffy, A. Heidelise, Autism, spectrum or clusters? An EEG coherence study, BMC Neurol., 19 (2019), 27. https://doi.org/10.1186/s12883-019-1254-1 doi: 10.1186/s12883-019-1254-1
    [31] A. Sheikhani, H. Behnam, M. R. Mohammadi, M. Noroozian, Analysis of EEG background activity in Autsim disease patients with bispectrum and STFT measure, Proceedings of the 11th WSEAS Int. Conf. Commun., Agios Nikolaos, Greece, 2007.
    [32] J. Kang, H. Chen, X. Li, X. Li, EEG entropy analysis in autistic children, J. Clin. Neurosci., 62 (2019), 199–206. https://doi.org/10.1016/j.jocn.2018.11.027 doi: 10.1016/j.jocn.2018.11.027
    [33] L. Billeci, F. Sicca, K. Maharatna, F. Apicella, A. Narzisi, G. Campatelli, et al., On the application of quantitative EEG for characterizing autistic brain: a systematic review, Front. Hum. Neurosci., 7 (2013), 442. https://doi.org/10.3389/fnhum.2013.00442 doi: 10.3389/fnhum.2013.00442
    [34] M. Ahmadlou, H. Adeli, A. Adeli, Fractality and a wavelet-chaos-neural network methodology for EEG-based diagnosis of autistic spectrum disorder, J. Clin. Neurophysiol., 27 (2010), 328–333. https://doi.org/10.1097/WNP.0b013e3181f40dc8 doi: 10.1097/WNP.0b013e3181f40dc8
    [35] B. B. Mandelbrot, The Fractal Geometry of Nature. New York: Freeman and Company (1977), 1–468.
    [36] M. Costa, A. L. Goldberger, C. K. Peng, Multiscale entropy analysis of biological signals. Phys. Rev. E., 71 (2005), 021906. https://doi.org/10.1103/PhysRevE.71.021906 doi: 10.1103/PhysRevE.71.021906
    [37] A. Namdari, Z. Li, A review of entropy measures for uncertainty quantification of stochastic processes, Adv. Mechanical Eng., 11 (2019), 1–14. https://doi.org/10.1177/1687814019857350 doi: 10.1177/1687814019857350
    [38] H. Hadoush, M. Alafeef, E. Abdulhay, Brain complexity in children with mild and severe autism spectrum disorders: analysis of multiscale entropy in EEG, Brain Topography, 32 (2019), 914–921. https://doi.org/10.1007/s10548-019-00711-1 doi: 10.1007/s10548-019-00711-1
    [39] Y. Ghanbari, L. Bloy, J. C. Edgar, L. Blaskey, R. Verma, T. P. Roberts, Joint analysis of band-specific functional connectivity and signal complexity in autism, J. Autism Dev. Disord., 45 (2015), 444–460. https://doi.org/10.1007/s10803-013-1915-7 doi: 10.1007/s10803-013-1915-7
    [40] T. Liu, Y. Chen, D. Chen, C. Li, Y. Qiu, J. Wang, Altered electroencephalogram complexity in autistic children shown by the multiscale entropy approach, Neuro. Report, 28 (2017), 169–173. https://doi.org/10.1097/WNR.0000000000000724 doi: 10.1097/WNR.0000000000000724
    [41] J. O. Maximo, D. L. Murdaugh, R. K. Kana, Alterations in Brain Entropy in Autism Spectrum Disorders, 2017 Int. Meet. Autism Res., Birmingham, USA, 2017.
    [42] J. Q. Kosciessa, N. A. Kloosterman, D. D. Garrett, Standard multiscale entropy reflects neural dynamics at mismatched temporal scales: What's signal irregularity got to do with it?, PLOS Comput. Biol., 16 (2020), e1007885. https://doi.org/10.1371/journal.pcbi.1007885 doi: 10.1371/journal.pcbi.1007885
    [43] A. Catarino, O. Churches, S. B. Cohen, A. Andrade, H. Ring, Atypical EEG complexity in autism spectrum conditions: a multiscale, entropy analysis, Clin. Neurophysiol., 122 (2011), 2375–2383. https://doi.org/10.1016/j.clinph.2011.05.004 doi: 10.1016/j.clinph.2011.05.004
    [44] J. S. Richman, J. R. Moorman, Physiological time-series analysis using approximate entropy and sample entropy, Am. J. Physiol. Heart Circ. Physiol., 278 (2000), H2039–49. https://doi.org/10.1152/ajpheart.2000.278.6.H2039 doi: 10.1152/ajpheart.2000.278.6.H2039
    [45] R. Ferenets, T. Lipping, A. Anier, V. Jantti, S. Melto, S. Hovilehto, Comparison of entropy and complexity measures for the assessment of depth of sedation, IEEE Trans. Biomed. Eng., 53 (2006), 1067–1077. https://doi.org/10.1109/TBME.2006.873543 doi: 10.1109/TBME.2006.873543
    [46] A. H. Heurtier, The Multiscale Entropy Algorithm and Its Variants: A Review, Entropy, 17 (2015), 3110–3123. https://doi.org/10.3390/e17053110 doi: 10.3390/e17053110
    [47] H. Azami and J. Escudero, Amplitude- and Fluctuation-Based Dispersion Entropy, Entropy, 20 (2018), 210. https://doi.org/10.3390/e20030210 doi: 10.3390/e20030210
    [48] J. F. Valencia, A. Porta, M. Vallverdu, F. Claria, R. Baranowski, E. O. Baranowska, et al., Refined multiscale entropy: Application to 24-h Holter recordings of heart period variability in healthy and aortic stenosis subjects, IEEE Trans. Biomed., 56 (2009), 2202–2213. https://doi.org/10.1109/TBME.2009.2021986 doi: 10.1109/TBME.2009.2021986
    [49] J. F. Valencia, M. Vallverdu, R. Schroeder, L. Cygankiewicz, R. Vazquez, A. B. Luna, et al., Heart rate variability characterized by refined multiscale entropy applied to cardiac death in ischemic cardiomyopathy patients, Comput. Cardiol., 37 (2010), 65–68.
    [50] W. J. Bosl, T. Loddenkemper, C. A. Nelson, Nonlinear EEG biomarker profiles for autism and absence epilepsy, Neuropsychiatric Electrophysiology, 3 (2017), 1. https://doi.org/10.1186/s40810-017-0023-x doi: 10.1186/s40810-017-0023-x
    [51] W. J. Bosl, H. T. Flusberg, C. A. Nelson, EEG Analytics for Early Detection of Autism Spectrum Disorder: A data-driven approach, Sci. Rep., 8 (2018), 6828. https://doi.org/10.1038/s41598-018-24318-x doi: 10.1038/s41598-018-24318-x
    [52] S. D. Wu, C.W. Wu, K.Y. Lee, S. G. Lin, Modified multiscale entropy for short-term time series analysis, Physica A, 392 (2013), 15865–5873. https://doi.org/10.1016/j.physa.2013.07.075 doi: 10.1016/j.physa.2013.07.075
    [53] S. D. Wu, C. W. Wu, S. G. Lin, C. C. Wang, K. Y. Lee, Time series analysis using composite multiscale entropy, Entropy, 15 (2013), 1069–1084. https://doi.org/10.3390/e15031069 doi: 10.3390/e15031069
    [54] S. D. Wu, C. W. Wu, S. G. Lin, K. Y. Lee, C. K. Peng, Analysis of complex time series using refined composite multiscale entropy, Phys. Lett. A, 378 (2014), 1369–1374. https://doi.org/10.1016/j.physleta.2014.03.034 doi: 10.1016/j.physleta.2014.03.034
    [55] S. D. Wu, C. W. Wu, K. Y. Lee, S. G. Lin, Modified multiscale entropy for short-term time series analysis, Phys. A, 392 (2013), 5865–5873. https://doi.org/10.1016/j.physa.2013.07.075 doi: 10.1016/j.physa.2013.07.075
    [56] Y. C. Chang, H. T. Wu, H. R. Chen, A. B. Liu, J. J. Yeh, M. T. Lo, et al., Application of a modified entropy computational method in assessing the complexity of pulse wave velocity signals in healthy and diabetic subjects, Entropy, 16 (2014), 4032–4043. https://doi.org/10.3390/e16074032 doi: 10.3390/e16074032
    [57] Y. Jiang, C. K. Peng, Y. Xu, Hierarchical entropy analysis for biological signals, J. Comput. Appl. Math., 236 (2011), 728–742. https://doi.org/10.1016/j.cam.2011.06.007 doi: 10.1016/j.cam.2011.06.007
    [58] H. B. Xie, W. X. He, H. Liu, Measuring time series regularity using nonlinear similarity-based sample entropy, Phys. Lett. A, 372 (2008), 7140–7146. https://doi.org/10.1016/j.physleta.2008.10.049 doi: 10.1016/j.physleta.2008.10.049
    [59] M. U. Ahmed, D. P. Mandic, Multivariate multiscale entropy analysis, IEEE Signal Process. Lett., 19 (2012), 91–94. https://doi.org/10.1109/LSP.2011.2180713 doi: 10.1109/LSP.2011.2180713
    [60] M. D. Costa, A. L. Goldberger, Generalized multiscale entropy analysis: Application to quantifying the complex volatility of human heartbeat time series, Entropy, 17 (2015), 1197–1203. https://doi.org/10.3390/e17031197 doi: 10.3390/e17031197
    [61] L. Faes, A. Porta, M. Javorka, G. Nollo, Efficient Computation of Multiscale Entropy over Short Biomedical Time Series Based on Linear State-Space Models, Complexity, 2017 (2017), 1768264. https://doi.org/10.1155/2017/1768264 doi: 10.1155/2017/1768264
    [62] T. Takahashi, Complexity of spontaneous brain activity in mental disorders, Prog. Neuropsychopharmacol. Biol. Psychiatry, 45 (2013), 258–266. https://doi.org/10.1016/j.pnpbp.2012.05.001 doi: 10.1016/j.pnpbp.2012.05.001
    [63] N. Huang, Z. Shen, S. Long, M. Wu, H. H. Shih, Q. Zheng, et al., The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis, Proc. Math. Phys. Eng. Sci., 454 (1998), 903–995. https://doi.org/10.1098/rspa.1998.0193 doi: 10.1098/rspa.1998.0193
    [64] N. E. Huang, Z. Wu, A review on Hilbert-Huang transform: Method and its applications to geophysical studies, Rev. Geophys., 46 (2008), 228–251. https://doi.org/10.1029/2007RG000228 doi: 10.1029/2007RG000228
    [65] F. R. Kschischang, The Hilbert Transform. Toronto: University of Toronto, 2006.
    [66] E. Abdulhay, P.Y. Guméry, J. Fontecave, P. Baconnier, Cardiogenic oscillations extraction in inductive plethysmography: Ensemble empirical mode decomposition, Annu. Int. Conf. IEEE Eng. Med. Biol. Soc., Minnesota, USA, 2009, 2240–2243. https://doi.org/10.1109/IEMBS.2009.5335004
    [67] X. Han, J. Peng, A. Cui, F. Zhao, Sparse Principal Component Analysis via Fractional Function Regularity, Math. Probl. Eng., 2020 (2020), 7874140. https://doi.org/10.1155/2020/7874140 doi: 10.1155/2020/7874140
    [68] C. K. Arthur, V. A. Temeng, Y. Y. Ziggah, Performance Evaluation of Training Algorithms in Backpropagation Neural Network Approach to Blast-Induced Ground Vibration Prediction, Ghana Mining J., 20 (2020), 20–33. https://doi.org/10.4314/gm.v20i1.3 doi: 10.4314/gm.v20i1.3
    [69] K. Kovarski, J. Malvy, R. K. Khanna, S. Arsène, M. Batty, M. Latinus, Reduced visual evoked potential amplitude in autism spectrum disorder, a variability effect?, Translational Psychiatry, 9 (2019), 341. https://doi.org/10.1038/s41398-019-0672-6 doi: 10.1038/s41398-019-0672-6
    [70] S. A. Nastase, V. Iacovella, B. Davis, U. Hasson, Connectivity in the human brain dissociates entropy and complexity of auditory inputs, NeuroImage, 31 (2015), 292–300. https://doi.org/10.1016/j.neuroimage.2014.12.048 doi: 10.1016/j.neuroimage.2014.12.048
    [71] P. Barttfeld, B. Wicker, S. Cukier, S. Navarta, S. Lew, M. Sigman, A big-world network in ASD: dynamical connectivity analysis reflects a deficit in longrange connections and an excess of short-range connections, Neuropsychologia, 49 (2015), 254–263. https://doi.org/10.1016/j.neuropsychologia.2010.11.024 doi: 10.1016/j.neuropsychologia.2010.11.024
    [72] H. Zhang, R. Li, X. Wen, Q. Li, X. Wu, Altered Time-Frequency Feature in Default Mode Network of Autism Based on Improved Hilbert-Huang Transform, IEEE J. Biomed. Health Informatics, 25 (2021), 485–492. https://doi.org/10.1109/JBHI.2020.2993109 doi: 10.1109/JBHI.2020.2993109
    [73] T. Wadhera, D. Kakkar, Conditional entropy approach to analyze cognitive dynamics in autism spectrum disorder, Neurol. Res., 42 (2020), 869–878. https://doi.org/10.1080/01616412.2020.1788844 doi: 10.1080/01616412.2020.1788844
    [74] E. Gani, N. Handayani, S. H. Pratama, N. Afif, F. Aziezah, A. C. Keintjem, et al., Brainwaves Analysis Using Spectral Entropy in Children with Autism Spectrum Disorders (ASD), J. phys. Conf. ser., 1505 (2020), 012070. https://doi.org/10.1088/1742-6596/1505/1/012070 doi: 10.1088/1742-6596/1505/1/012070
    [75] E. Amiot, Entropy of Fourier coefficients of periodic musical objects, J. Math. Music, 15 (2021), 235–246. https://doi.org/10.1080/17459737.2020.1777592 doi: 10.1080/17459737.2020.1777592
    [76] D. Abásolo, R. Hornero, P. Espino, D. Alvarez, J. Poza, Entropy analysis of the EEG background activity in Alzheimer's disease patients, Physiol. Meas., 27 (2006), 241–253. https://doi.org/10.1088/0967-3334/27/3/003 doi: 10.1088/0967-3334/27/3/003
    [77] J. Han, Y. Li, J. Kang, E. Cai, Z. Tong, G. Ouyang, et al., Global Synchronization of Multichannel EEG Based on Rényi Entropy in Children with Autism Spectrum Disorder, Appl. Sci., 7 (2017), 257. https://doi.org/10.3390/app7030257 doi: 10.3390/app7030257
    [78] E. Abdulhay, M. Alafeef, H. Hadoush, N. Arunkumar, Resting State EEG-based Diagnosis of Autism via Elliptic Area of Continuous Wavelet Transform Complex Plot, J. Intell. fuzzy syst., 39 (2020), 8599–8607. https://doi.org/10.3233/JIFS-189176 doi: 10.3233/JIFS-189176
    [79] R. Okazaki, T. Takahashi, K. Ueno, K. Takahashi, M. Ishitobi, M. Kikuchi, et al., Changes in EEG complexity with electroconvulsive therapy in a patient with autism spectrum disorders: a multiscale entropy approach, Front. Hum. Neurosci., 9 (2015), 25767444. https://doi.org/10.3389/fnhum.2015.00106 doi: 10.3389/fnhum.2015.00106
    [80] S. Thapaliya, S. Jayarathna, M. Jaime, Evaluating the EEG and eye movements for autism spectrum disorder, 2018 IEEE Int. Conf. Big Data, Seattle, WA, USA, 2018. https://doi.org/10.1109/BigData.2018.8622501
    [81] J. Eldridge, A. E. Lane, M. Belkin, S. Dennis, Robust features for the automatic identification of autism spectrum disorder in children, J. Neurodev. Disord., 6 (2014), 1–12. https://doi.org/10.1186/1866-1955-6-12 doi: 10.1186/1866-1955-6-1
    [82] H. Amoud, H. Snoussi, D. Hewson, M. Doussot, J. Duchêne, Intrinsic mode entropy for nonlinear discriminant analysis, IEEE Signal Process. Lett., 14 (2007), 297–300. https://doi.org/10.1109/LSP.2006.888089 doi: 10.1109/LSP.2006.888089
    [83] M. Hu, H. Liang, Adaptive multiscale entropy analysis of multivariate neural data, IEEE Trans. Biomed. Eng., 59 (2012), 12–15. https://doi.org/10.1109/TBME.2011.2162511 doi: 10.1109/TBME.2011.2162511
    [84] O. Dekhil, M. Ali, Y. E. Nakeib, A. Shalaby, A. Soliman, A. Switala, et.al., A Personalized Autism Diagnosis CAD System Using a Fusion of Structural MRI and Resting-State Functional MRI Data. Front. Psychiatry, 10 (2021), 1–16. https://doi.org/10.3389/fpsyt.2019.00392 doi: 10.3389/fpsyt.2019.00392
    [85] O. Dekhil, M. Ali, R. Haweel, Y. Elnakeib, M. Ghazal, H. Hajjdiab, et.al. A Comprehensive Framework for Differentiating Autism Spectrum Disorder From Neurotypicals by Fusing Structural MRI and Resting State Functional MRI, Seminars in Pediatric Neurology., 34 (2020), 100805. https://doi.org/10.1016/j.spen.2020.100805 doi: 10.1016/j.spen.2020.100805
    [86] K. Barik, K. Watanabe, J. Bhattacharya, G. Saha, Classification of Autism in Young Children by Phase Angle Clustering in Magnetoencephalogram Signals, 2020 National Conf. Commun. (NCC), Kharagpur, India, 2020, 1–6. https://doi.org/10.1109/NCC48643.2020.9056022
  • This article has been cited by:

    1. Suhas A. Bhyratae, Neha Mangla, DESIGN AND DEVELOPMENT OF ADVANCED SIMILARITY MEASURE FOR RECONSTRUCTING GRN USING mRNA EXPRESSION PROFILES, 2022, 34, 1016-2372, 10.4015/S1016237222500247
  • Reader Comments
  • © 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(3556) PDF downloads(167) Cited by(3)

Figures and Tables

Figures(7)  /  Tables(5)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog