Research article Special Issues

Intelligent immune clonal optimization algorithm for pulmonary nodule classification


  • Computer-aided diagnosis (CAD) of pulmonary nodules is an effective approach for early detection of lung cancers, and pulmonary nodule classification is one of the key issues in the CAD system. However, CAD has the problems of low accuracy and high false-positive rate (FPR) on pulmonary nodule classification. To solve these problems, a novel method using intelligent immune clonal selection and classification algorithm is proposed and developed in this work. First, according to the mechanism and characteristics of chaotic motion with a logistic mapping, the proposed method utilizes the characteristics of chaotic motion and selects the control factor of the optimal chaotic state, to generate an initial population with randomness and ergodicity. The singleness problem of the initial population of the immune algorithm was solved by the proposed method. Second, considering on the characteristics of Gaussian mutation operator (GMO) with a small scale, and Cauchy mutation operator (CMO) with a big scale, an intelligent mutation strategy is developed, and a novel control factor of the mutation is designed. Therefore, a Gauss-Cauchy hybrid mutation operator is designed. Ultimately, in this study, the intelligent immune clonal optimization algorithm is proposed and developed for pulmonary nodule classification. To verify its accuracy, the proposed method was used to analyze 90 CT scans with 652 nodules. The experimental results revealed that the proposed method had an accuracy of 97.87% and produced 1.52 false positives per scan (FPs/scan), indicating that the proposed method has high accuracy and low FPR.

    Citation: Qi Mao, Shuguang Zhao, Lijia Ren, Zhiwei Li, Dongbing Tong, Xing Yuan, Haibo Li. Intelligent immune clonal optimization algorithm for pulmonary nodule classification[J]. Mathematical Biosciences and Engineering, 2021, 18(4): 4146-4161. doi: 10.3934/mbe.2021208

    Related Papers:

    [1] Ziyu Jin, Ning Li . Diagnosis of each main coronary artery stenosis based on whale optimization algorithm and stacking model. Mathematical Biosciences and Engineering, 2022, 19(5): 4568-4591. doi: 10.3934/mbe.2022211
    [2] Chengkang Li, Ran Wei, Yishen Mao, Yi Guo, Ji Li, Yuanyuan Wang . Computer-aided differentiates benign from malignant IPMN and MCN with a novel feature selection algorithm. Mathematical Biosciences and Engineering, 2021, 18(4): 4743-4760. doi: 10.3934/mbe.2021241
    [3] Jun Gao, Qian Jiang, Bo Zhou, Daozheng Chen . Convolutional neural networks for computer-aided detection or diagnosis in medical image analysis: An overview. Mathematical Biosciences and Engineering, 2019, 16(6): 6536-6561. doi: 10.3934/mbe.2019326
    [4] Guanyu Wang . The Effects of Affinity Mediated Clonal Expansion of Premigrant Thymocytes on the Periphery T-Cell Repertoire. Mathematical Biosciences and Engineering, 2005, 2(1): 153-168. doi: 10.3934/mbe.2005.2.153
    [5] Yingjian Yang, Wei Li, Yingwei Guo, Nanrong Zeng, Shicong Wang, Ziran Chen, Yang Liu, Huai Chen, Wenxin Duan, Xian Li, Wei Zhao, Rongchang Chen, Yan Kang . Lung radiomics features for characterizing and classifying COPD stage based on feature combination strategy and multi-layer perceptron classifier. Mathematical Biosciences and Engineering, 2022, 19(8): 7826-7855. doi: 10.3934/mbe.2022366
    [6] Jiahao Zhang, Zhengming Gao, Suruo Li, Juan Zhao, Wenguang Song . Improved intelligent clonal optimizer based on adaptive parameter strategy. Mathematical Biosciences and Engineering, 2022, 19(10): 10275-10315. doi: 10.3934/mbe.2022481
    [7] Jingyao Liu, Qinghe Feng, Yu Miao, Wei He, Weili Shi, Zhengang Jiang . COVID-19 disease identification network based on weakly supervised feature selection. Mathematical Biosciences and Engineering, 2023, 20(5): 9327-9348. doi: 10.3934/mbe.2023409
    [8] Xiwen Qin, Shuang Zhang, Dongmei Yin, Dongxue Chen, Xiaogang Dong . Two-stage feature selection for classification of gene expression data based on an improved Salp Swarm Algorithm. Mathematical Biosciences and Engineering, 2022, 19(12): 13747-13781. doi: 10.3934/mbe.2022641
    [9] Binju Saju, Neethu Tressa, Rajesh Kumar Dhanaraj, Sumegh Tharewal, Jincy Chundamannil Mathew, Danilo Pelusi . Effective multi-class lungdisease classification using the hybridfeature engineering mechanism. Mathematical Biosciences and Engineering, 2023, 20(11): 20245-20273. doi: 10.3934/mbe.2023896
    [10] Kun Lan, Jianzhen Cheng, Jinyun Jiang, Xiaoliang Jiang, Qile Zhang . Modified UNet++ with atrous spatial pyramid pooling for blood cell image segmentation. Mathematical Biosciences and Engineering, 2023, 20(1): 1420-1433. doi: 10.3934/mbe.2023064
  • Computer-aided diagnosis (CAD) of pulmonary nodules is an effective approach for early detection of lung cancers, and pulmonary nodule classification is one of the key issues in the CAD system. However, CAD has the problems of low accuracy and high false-positive rate (FPR) on pulmonary nodule classification. To solve these problems, a novel method using intelligent immune clonal selection and classification algorithm is proposed and developed in this work. First, according to the mechanism and characteristics of chaotic motion with a logistic mapping, the proposed method utilizes the characteristics of chaotic motion and selects the control factor of the optimal chaotic state, to generate an initial population with randomness and ergodicity. The singleness problem of the initial population of the immune algorithm was solved by the proposed method. Second, considering on the characteristics of Gaussian mutation operator (GMO) with a small scale, and Cauchy mutation operator (CMO) with a big scale, an intelligent mutation strategy is developed, and a novel control factor of the mutation is designed. Therefore, a Gauss-Cauchy hybrid mutation operator is designed. Ultimately, in this study, the intelligent immune clonal optimization algorithm is proposed and developed for pulmonary nodule classification. To verify its accuracy, the proposed method was used to analyze 90 CT scans with 652 nodules. The experimental results revealed that the proposed method had an accuracy of 97.87% and produced 1.52 false positives per scan (FPs/scan), indicating that the proposed method has high accuracy and low FPR.



    Lung cancer is one of the highest rates of morbidity and mortality in patients among the worldwide, and an effective approach for lung nodule detection and classification is very important for the early computer-aided diagnosis of lung cancer [1,2]. In this regard, the methods based on artificial intelligence have been proposed and developed, such as artificial neural network [3], evolutionary computation [4], and artificial immune system (AIS) [5]. Notably, AIS is inspired by human immune system, which has the characteristics of adaptability, robustness, homeostasis, memory, and immunity. Inspired by the human immune system, some new modes, algorithms, technologies and theoretical understandings were presented in the literation [6,7,8]. Feature selection is one of the most important tasks in many engineering problems. Importantly, AIS has a better performance in feature selection, which has the advantages of global optimization, parallel computing, noise tolerance, etc. It is widely applied in scientific computation and solving engineering problems of the existing literatures [8,9,10]. The feature selection algorithms based on immune-inspired have the advantage of parallel computing in multi-dimensional space.

    Computer-aided lung cancer diagnosis mainly includes lung parenchyma extraction, the region of interest segmentation, the feature extraction and classification. In the image processing, feature selection and classification are important issues for the early detection of lung cancers [11,12,13,14]. In the past few years, many different models and approaches were designed for feature selection and classification, each with its own strengths and weaknesses. For instance, the method of kernel optimization was used for feature selection in the high-dimensional data, experimental results demonstrated the efficiency and accuracy [15]. The fuzzy clustering and hybrid intelligent methods were used for the diagnosis of pulmonary nodules [16]. In addition, Froz et al. [4] used an expert system for lung nodule classification, obtained an accuracy of 94.3% and a specificity of 94.78%. Lu et al. [5] used the genetic algorithm for feature selection, and support vector machine (SVM) for pulmonary nodule classification, obtained an accuracy of 99%. Similarly, Deep learning methods also have good performance in feature selection and classification. For example, the deep reinforcement learning method was adopted for lung nodule detection and obtained the accuracy of 64.4% [17]. In order to improve the accuracy of classification, the deep learning and swarm intelligence of computational intelligence techniques were used to develop types of nodule detection and classification, which achieved up to 93.71% accuracy [18]. In other respects, a novel convolutional neural networks with carefully designed loss function regularization was proposed for thyroid nodule detection in the ultrasound images, the proposed model can detect various types of thyroid nodules [19].

    From the analysis of literature, it was observed that most of the studies thought that feature selection and classification were the independent processes, whereupon the feature selection and classification were done separately. Although the process is the local optimization, the final results are not global optimum. Actually, the processes are closely related and mutually influential, which could be performed simultaneously. In this regard, many methods have been proposed. For example, the genetic algorithm was used for feature selection and the SVM parameter optimization [20]. But genetic algorithm has an issue of the degradation phenomenon in the solving process of global optimal, which is easily premature convergence. Here, an immune algorithm can solve the problem by produces excellent antibodies through the memory function. However, immune algorithm also has some shortcomings that the initial population is insufficient diversity, and the variation scale is big or small. To solving these issues, tremendous research efforts have done in the past few years. For instance, Jordehi [21] introduced the chaotic strategies into the immune algorithm and designed to solve the global continuous optimization problems. Wu et al. [22] used the chaotic producers to replace the random generators and used the parallel mutation strategy for the hypermutation of antibodies. Liang et al. [23] used the logistic mapping to generate the initial antibody populations for the diversity of antibodies, etc. While most of the approaches thought that the value of a control factor of the chaotic motion is randomly. Recently, we found that it takes an appropriate value of the control factor will generate the exciting results.

    In this work, we focused on improving the performance of immune clonal optimization algorithm for pulmonary nodule classification in chest CT images. Therefore, the method of intelligent immune clonal selection is proposed and developed by the population diversity and intelligent mutation. First, according to the mechanism and characteristics of chaotic motion, the proposed method utilizes the characteristics of chaotic motion and selects the control factor of optimal chaotic state, to generate the initial population with randomness and ergodicity. Second, the proposed method solved the singleness problem of the initial population of immune algorithm. Then based on the characteristics of the Gaussian mutation operator with a small scale, and Cauchy mutation operator with a big scale, the intelligent mutation strategy is developed, and the control factor of intelligent mutation is designed. Especially, the intelligent Gauss-Cauchy hybrid mutation operator is designed and implemented. Additionally, the Hessian information and multi-scale reverse LoG (Hessian-MRLoG) method were used to detect and locate the lesion area in the image preprocessing, and the candidate nodule areas are extracted by an algorithm of the lung nodule segmentation. Therefore, the computational amount of feature extraction and classification is greatly reduced, the data quality is improved and the computational complexity is reduced.

    The remainder of this paper is organized as follows. Section 2 describes the method and approach of the proposed intelligent immune clonal for feature selection and parameter optimization. Section 3, the experimental results obtained by the proposed method. Lastly, Section 4 presents the final remarks of this work.

    The chaotic mapping was adopted for antibody population initialization in the immune clonal selection algorithm. According to the theoretical study of population models, the logistic mapping model is described as follows:

    xn+1=μxn(1xn),n=1,2,... (1)

    where μ is a regulation factor of the chaotic behavioral and μ > 0, x(n) is the chaotic variable and n is the number of iterations. Let the initial condition x0∈[0, 1], μ∈[0, 4], then the diagram of chaotic motion bifurcation is shown in Figure 1.

    Figure 1.  The bifurcation diagram of the logistic model.

    From Figure 1, if 0 < μ < 3, then the chaotic motion shows single-cycle sequence converges. If 3 < μ < 3.449, then the chaotic motion converges with a period of 2 times. If 3.449 < μ < 3.57, then the chaotic motion converges with a 4-fold period. If 3.57 < μ < 4, the chaotic motion comes into a chaotic state. If μ = 4, then the system enters a completely chaotic state. Based on the analysis, it is shown that a system evolves into chaotic state when the regulation factor 3.57 < μ ≤ 4. Therefore, let the initial conditions of x0∈[0, 1], μ∈[0, 4], and μ = 4, n = 200, then the random sequence is generated by the logistic model, which is shown in Figure 2.

    Figure 2.  The ergodicity of the logistic mapping.

    Figure 2 shows the ergodicity of the logistic mapping with a suitable regulation factor, which was generated an initial antibody population with diversity. Thus, we utilize the characteristics of chaotic motion and select the regulation factor of optimal chaotic state, to generate an initial antibody population with randomness and ergodicity. It solved the singleness problem of the initial population generation of the immune clonal selection algorithm.

    According to the literature [24], the combination of Gauss-Cauchy is written as:

    fgc=k(fgaussian(μ,σ)+fcauchy(x0,γ)) (2)

    where k is an adjustment factor. fgaussian (μ, σ) is a Gaussian distribution function, where σ is the standard deviation and σ > 0, μ is the mathematical expectation. fcauchy (x0, γ) is a Cauchy distribution function, where x0 is the length setting of x-axis, γ is a window scale, and x∈(-∞, +∞). Let k = 1/2, μ = 0, σ = 1; x0 = 0, γ = 1, respectively, then the average distribution curve of Gauss-Cauchy combination is shown in Figure 3.

    Figure 3.  The average distribution curve of the Gauss-Cauchy combination.

    Therefore, the hybrid Gauss-Cauchy mutation operator is described as follows:

    x=x+k(fgaussian(μ,σ)+fcauchy(x0,γ)) (3)

    More importantly, we designed an intelligent strategy for the mutation, which is described below: In early phase, the mutation rate has the characteristics of large-scale variation; in the intermediate phase, the mutation rate has a middle-scale variation; in the late stage, the mutation rate has a small-scale variation. Thus, the mutation rate reduced gradually with the increase of evolutionary iterations. The mutation rate is designed by:

    v=1(t/tmax) (4)

    where t is the current evolutionary iterations, tmax is the maximum iterations. The mutation rate of ν is a monotonically decreasing function, and ν∈[0, 1]. Consequently, the intelligent mutation operator (IMO) is given as:

    x=x+vk(fgaussian(μ,σ)+fcauchy(x0,γ)) (5)

    where k is an adjustment factor, fgaussian (μ, σ) is a Gaussian distribution function, fcauchy (x0, γ) is a Cauchy distribution function. With the guidance of intelligent mutation strategy, the IMO not only has the local convergence characteristics of Gaussian mutation, but also the global convergence characteristics of Cauchy mutation.

    The compound features were extracted in this work from three aspects, which include the intensity features, shape features, and texture features [25]. The character representation is listed in Table 1.Though the feature combination, 26 feature vectors were extracted for feature selection. Then the 26-dimensional feature space was constructed.

    Table 1.  Feature description.
    Classification Feature name Number Formulas
    I.The intensity features (f1-f5) Mean value f1 Li=1[iI(i)]
    Variance f2 Li=1[(iμ)2I(i)]
    Deviation f3 (1/σ2)Li=1[(iμ)3I(i)]
    Energy f4 Li=1I2(i)
    Gray-scale entropy f5 Li=1[I(i)log2I(i)]
    II. The shape features (f6-f10) Circularity f6 P02/S0
    Compactness f7 4π·Area/(Circumference)2
    Elongation f8 rmax / rmin
    Convex degree f9 Area / Convex_Area
    Rectangularity f10 Wr/ Lr
    Compactness f11 rinside / routside
    Moment f12 Mi=1Nj=1[I2(i,j)/(1+|ij|)]
    III.Text features (f1-f14) Angular second moment f13 Ngi=1Ngi=1[p2(i,j)]
    Contrast f14 Ngn=1n2{Ngi=1Ngj=1p(i,j)}
    Correlation f15 (σxσy)1Ngi=1Ngi=1{(ij)p(i,j)μxμy}
    Variance f16 Ngi=1Ngj=1{(iμx)(iμy)p(i,j)}
    Inverse difference moment f17 Ngi=1Ngj=1[1/(1+(ij)2)p(i,j)]
    Sum average f18 2Ngi=2(ipx+y(i))
    Sum variance f19 2Ngi=2[(if8)2px+y(i)]
    Sum entropy f20 2Ngi=2[px+y(i)log(px+y(i))]
    Entropy f21 Ngi=1Ngj=1{p(i,j)log(p(i,j))}
    Difference variance f22 Var(Ngi=1Ngj=1p(i,j))
    Difference entropy f23 Ngn=1{pxy(i)log(pxy(i))}
    Information measures of correlation 1 f24 {Ngi=1Nsj=1p(i,j)log(p(i,j))+Nsi=1Nsj=1p(i,j)log(Nsj=1p(i,j)Nsi=1p(i,j))}{max(Nsi=1p(i,j),Ngi=1p(i,j))}1
    Information measures of correlation 2 f25 {1exp(2Ngi=1Ngi=1(px(i)py(j)log(px(i)py(j))+f9)}1/2
    Maximal correlation coefficient f26  (Second largest eigenvalue of Q)1/2, where Q(i,j)=k{p(i,k)p(j,k)/(px(i)py(j))}

     | Show Table
    DownLoad: CSV

    (1) Radial basis kernel function (RBF)

    According to the study of lung nodule model in our previous study [26], we know that the intensity distribution of a lung nodule in the CT image, gradually increases from the periphery to the center. Therefore, this paper adopts RBF as the kernel function, which has a better performance on nonlinear mapping. A 2-dimensional (2D) Gaussian kernel function is represented by:

    K(x,xi)=exp(xxi22σ2) (6)

    Let γ = 1/(2σ2), then the Eq (6) is instead by a RBF

    K(x,xi)=exp(γxxi2) (7)

    (2) Chromosome coding method

    The antibody coding is the combining process of feature subsets and SVM parameters. An antibody is a result of the combination of feature subsets and SVM parameters. The synchronous optimization of the feature selection and SVM parameter is to find the optimal antibodies. It is noteworthy that an antibody has multiple gene positions, the number of gene positions is equal to the length of the coding string. In the chromosome coding, 1 is a dominant gene, which indicates the selected feature; 0 is a recessive gene, which indicates the unselected feature. In this work, the chromosome coding includes the penalty factor of C, the kernel parameter of r, and the feature set of f. While, the corresponding number of genes are represented by I, J and K, respectively. L denotes the string length, the encoded string is given by (xc1, xc2, …, xci, …, xcI; xγ1, xγ2, …, xγj, …, xγJ; xf1, xf2, …, xfk, …, xfK), and I + J + K = L. The chromosome encoding is shown in Figure 4.

    Figure 4.  Chromosome coding.

    From Figure 4, it should be noted that the first eight bits of (xc1, xc2, …, xci, …, xcI) are the coding of the penalty factor C, the following eight bits of (xγ1, xγ2, …, xγj, …, xγJ) are the coding of the kernel parameter γ, the last 26 bits of (xf1, xf2, …, xfk, …, xfK) are the coding of the feature set f.

    (3) The affinity function

     Affinity = SVM_Accuracy  (8)

    (4) Antibody selection and cloning

    If the population size is N, the probability of antibody fitness is:

    pfit=FitnessNi=1Fitness(i) (9)

    The number of clone is:

    Nc(i)=Round(pfitN/i) (10)

    Additionally, the intelligent mutation was implemented by the proposed intelligent mutation operator. From the above analysis, the method of immune clonal optimization algorithm (ICOA) for parameter optimization in support vector machine (ICOA-SVM) was proposed and developed. The flowchart of the developed method is shown in Figure 5.

    Figure 5.  Flowchart of the ICOA-SVM.

    It should be noted that the implementation procedure can be obtained from the Figure 5. The input is the candidate feature set of F: F = {(xi, yi)∣xFnL}, y∈(1, 2, …, L), i = 1, …, n}. The output are the optimized parameters of (C, γ), and the selected feature subsets of f = {f1, f2, …, fk}, 1 ≤ k ≤ K. The implementation of the proposed algorithm includes the training and the testing stages. In the process of immune clonal optimization, the antibodies with high affinity were selected and cloned, and generated the offspring of Ab′(l) = {Ab(l), Ab1(l), …, Abi(l), …, Abn(l)}, Abi(l) = (xi1, xi2, xi(Nc-1)), Nc(i) = Round(pfit·N/i). Additionally, the others new individuals were generated by the low affinity antibodies though the mutation of Ab′ = Ab + 0.5·(1 - t/tmax) (fgaussian(0, 1) + fcauchy(0, 1)).

    The proposed method was implemented in Matlab (R2015a; MathWorks, Inc., Natick, MA, USA) on a machine with Core i3 M 350 @ 2.27GHz, 6G RAM, 64-bit operating system, the CT images that we adopted were obtained from the international public LIDC database [27]. The 90 scans of CT images with the number of 652 lung nodules were randomly selected from the LIDC database. One scan has the number of 100~400 CT images. Thus, more than 20, 000 images were used in the experiments of this study.

    There are several steps for the image processing. First, medical images are usually noisy, especially the low-dose spiral CT images, which mainly have the quantum noise, electronic noise, Gaussian noise, and speckle noise [28]. Therefore, a hybrid filtering strategy was developed, which used the median filter for the isolated noise reduction, and the multi-scale windowed Fourier filtering for the noise reduction [29]. Second, the multi-scale elliptic rolling-ball method, which was proposed and developed in our previous work [30], was used for pulmonary parenchyma extraction. Third, the Hessian-MRLoG method [2] was used for the detection and location of pulmonary nodule candidates, then the candidate nodule areas were extracted by an algorithm of the lung nodule segmentation [29]. Four, the features of the regions of interest (ROI) were extracted to construct the 42-dimensional coding space. Especially, there are 26-dimensional feature vectors in feature subsets, which include 5 gray features, 7 shape features, and 14 texture features. The experimental dataset was used to verify the effectiveness of the proposed method, and the experimental adjustment parameters of the proposed method were shown in Table 2.

    Table 2.  Parameter settings of the ICOA-SVM.
    Parameters Name of the parameters Value
    N Population size 60
    tmax Maximum iterations 100
    L The length of chromosome genes 34
    m The number of elites 5
    Nc_max Maximum number of clones 120
    w Mutation probability 0.5
    ξi The relaxation variable 0.5
    k The number of k-fold cross 5

     | Show Table
    DownLoad: CSV

    Especially, the penalty factor of C∈[0.001, 2.2 × 104], the kernel parameter of γ∈[1 × 10-5, 5]. The initial population was generated by the chaotic mapping. The feature subsets and SVM parameters were selected and optimized by the developed method of ICO. The optimal feature subsets and the best matching parameters were obtained by five times of 5-fold cross-validation. According to the number of selected feature dimensions, the optimal feature subsets and best match parameters are shown in Table 3.

    Table 3.  Optimal feature subsets and best match parameters.
    Serial numbers Parameters of SVM Selected subsets of features Accuracy
    1 C = 3985, γ = 0.0432 f6, f7, f3, f25, f9 0.920
    2 C = 6591, γ = 0.0332 f6, f7, f4, f11, f9, f26, f14 0.965
    3 C = 6730, γ = 0.0436 f6, f7, f4, f5, f8, f9, f26, f14 0.978
    4 C = 8390, γ = 0.0529 f6, f7, f3, f5, f9, f26, f25, f2, f17, f14 0.940
    5 C = 8534, γ = 0.0623 f6, f7, f3, f4, f5, f8, f9, f26, f25, f14, f11, f1, f18 0.935

     | Show Table
    DownLoad: CSV

    In this experiment, the eight features of f6, f7, f4, f5, f8, f9, f26, and f14 constructing an optimal feature subset, and the corresponding parameters of SVM are C = 6730, γ = 0.0436, which obtained the highest accuracy of 97.8%. The experimental results revealed that the accuracy rate is not proportional to the dimensions of selected features. After ten times of 5-fold cross-validation, the relationship between the number of selected features and the classification accuracy is shown in Figure 6.

    Figure 6.  The relation curve.

    From Figure 6, the results indicate that it is not the more selected feature dimensions, the higher of the classification accuracy. Actually, it has a better match between the classification accuracy and the number of selected features. In this work, when the dimensions of selected features are 8, the accuracy of classification is the highest.

    The combination of the two-classification problems can be represented by a two-class confusion matrix [26]. If the practical situation is a nodule, and the prediction result is also a nodule, then it belongs to the nodule, this situation is the true positive (TP). If the practical situation is a non-nodule, and the prediction result is a nodule, then it belongs to the non-nodule, this situation is the false positive (FP). If the practical situation is a nodule, and the prediction result is a non-nodule, then it belongs to the situation of false negative (FN). If the practical situation is a non-nodule, and the prediction result is a non-nodule, then it belongs to the situation of true negative (TN). Generally speaking, the performance of classification algorithms are commonly evaluated by the receiver operating characteristic (ROC) curve and its area under the curve (AUC) value. The evaluation indicators are shown in Table 4.

    Table 4.  Evaluation indicators and the corresponding meanings.
    Indexes Implication Formulas
    Sensitivity True positive rate Sens = TP / (TP + FN)
    Specificity True negative rate Spec = TN / (TN + FP)
    Accuracy Accuracy Acc = (TP + TN) / (TP + TN + FP + FN)
    FPR False positive rate FPR = TP / (TP + FP)
    AUC The area under the characteristic curve AUC = ∑i{(1-3y(i) / 2-y(i-1) / 2) * (x(i)-x(i-1))}

     | Show Table
    DownLoad: CSV

    The bigger of the value of sensitivity, specificity, accuracy, and AUC, the better the performance of a classifier. Additionally, the lower the value of false positive rate (FPR), the better the performance of a classifier. In this work, at the same experimental conditions, we analyzed the properties of genetic algorithm (GA), immune clone selective algorithm (ICSA), and the proposed ICOA-SVM, respectively. Additionally, the approach of convolutional neural network (CNN) was used for the pulmonary nodule classification at the same experimental dataset. Especially, the initial learning rate is 0.001. After that, each generation of learning rate decays to 0.9 of the previous generation learning rate. After experiments, the experimental results were shown in Table 5.

    Table 5.  The comparison among different approaches.
    Methods Accuracy (%) Sensitivity (%) AUC (%)
    GA-SVM 91.37 92.25 94.92
    ICSA-SVM 93.25 94.75 96.54
    CNN 93.13 92.50 96.64
    The proposed 97.87 98.75 99.09

     | Show Table
    DownLoad: CSV

    From Table 5, the proposed method has the best performance among the four approaches. Therefore, the proposed method is superior to the mentioned approaches. Besides, to more clearly represent the classification performance, the ROC was plotted in Figure 7.

    Figure 7.  Comparison of the area under the curve of ROC.

    As can be seen from Figure 7, the area under the cure value of GA-SVM, ICSA-SVM, CNN, and the proposed are 94.92, 96.54, 96.64, and 99.09%, respectively. The proposed method has the biggest AUC value among the four approaches. According to its analysis, the proposed method is superior to the mentioned methods in the performance of lung nodule classification. Additionally, the performance of the proposed method was also compared with other related approaches, which reported in the literature in the last five years, as summarized in Table 6.

    Table 6.  Comparison of the results obtained in this study with those of other works.
    Literatures Scans Nodules Accuracy (%) Sensitivity (%) FPs/scan Dataset
    Lü et al. [3] 50 435 84.6 88.9 -- Private
    Liu et al. [31] 80 978 93.2 92.4 4.5 LIDC
    Liu et al. [32] 107 252 -- 89.4 2.0 LIDC
    Kaya [33] -- 439 84.7 67.4 -- LIDC
    Khan et al. [34] 84 103 97.5 98.9 -- LIDC
    Zhang et al. [35] -- 353 88.6 86.3 -- LIDC
    Jiang et al. [36] -- -- 90.24 92.04 11.06 LIDC
    Santos et al. [14] 28 252 -- 90.6 1.17 LIDC
    The proposed 90 652 97.87 98.75 1.52 LIDC

     | Show Table
    DownLoad: CSV

    The comparison reveals that the proposed method in this work has the highest sensitivity, while the approach reported in another study has the lowest FPR [14]. But, regarding the overall performance, the proposed method in this study has the best performance. Thus, the proposed method is superior to the traditional approaches. Additionally, the Hessian-MRLoG method was used to detect and locate the lesion area, then the candidate nodule areas were extracted by an algorithm of the lung nodule segmentation. As a result, the computational amount of feature extraction and classification is greatly reduced, the data quality is improved and the computational complexity is reduced. Therefore, this study gives a reference for a novel light-weight method of computer-aided diagnosis of pulmonary nodules.

    Computer-aided pulmonary nodule diagnosis is an effective approach for the early detection of lung cancers. In this paper, an intelligent immune clonal optimization algorithm is proposed and developed to find the optimal feature subset and best match parameters of the classification algorithm, which makes the feature selection and classification be performed simultaneously. The initial population was generated by the randomness and ergodicity of the chaotic mapping. The singleness problem of the initial population of the immune clonal selection algorithm was solved in this work. The proposed method has the global and local convergence characteristics by the intelligent mutation operator. Therefore, a novel method based on an intelligent immune clonal optimization algorithm for pulmonary nodule classification was proposed and developed in this work. The proposed approaches of this work were used for the classification of 90 scans with 652 nodules, and the results revealed an accuracy of 97.87% with the 1.52 FPs/scan. Consequently, the experimental results demonstrated that the proposed approach can improve the accuracy and reduce the false positive rate in pulmonary nodule classification. This work has the potential for deployment in the early computer-aided diagnosis of lung cancer.

    Supported by the grants from the National Natural Science Foundation of China (Grant No. 61673257, 61705127), the construction project of applied undergraduate of Shanghai for the major of electrical engineering and automation of SUES, and the doctoral training foundation of SUES (Grant No. 0232-A3-0100-21-0931).

    No potential conflicts of interest were disclosed.



    [1] W. L. Bi, A. Hosny, M. B. Schabath, M. L. Giger, N. J. Birkbak, A. Mehrtash, Artificial intelligence in cancer imaging: clinical challenges and applications, CA: Cancer J. Clin., 69 (2019), 127-150.
    [2] Q. Mao, S. Zhao, D. Tong, S. Su, Z. Li, X. Cheng, Hessian-MRLoG: Hessian information and multi-scale reverse LoG filter for pulmonary nodule detection, Comput. Biol. Med., 131 (2021), 104272. doi: 10.1016/j.compbiomed.2021.104272
    [3] X. Q. Lü, L. Wu, Y. Gu, W. L. Zhang, J. Li, Detection of low dose CT pulmonary nodules based on 3D convolution neural network, Opt. Precis. Eng., 26 (2018), 1211-1218. doi: 10.3788/OPE.20182605.1211
    [4] B. R. Froz, A. O. de C. Filho, A.C. Silva, A. C. de Paiva, R. A. Nunes, M. Gattass, Lung nodule classification using artificial crawlers, directional texture and support vector machine, Expert Syst. Appl., 69 (2017), 176-188.
    [5] C. Lu, Z. Zhu, X. Gu, An intelligent system for lung cancer diagnosis using a new genetic algorithm based feature selection method, J. Med. Syst., 38 (2014), 97. doi: 10.1007/s10916-014-0097-y
    [6] A. Abid, M. T. Khan, M. S. Khan, Multidomain features-based GA optimized artificial immune system for bearing fault detection, IEEE Trans. Syst., Man Cybern. Syst., 50 (2020), 348-359. doi: 10.1109/TSMC.2017.2746762
    [7] J. Gao, Q. Jiang, B. Zhou, D. Chen, Convolutional neural networks for computer-aided detection or diagnosis in medical image analysis: An overview, Math. Biosci. Eng., 16 (2019): 6536-6561. doi: 10.3934/mbe.2019326
    [8] Z. Wang, X. Tang, H. Liu, L. Peng, Artificial immune intelligence-inspired dynamic real-time computer forensics model, Math. Biosci. Eng., 17 (2020), 7221-7233.
    [9] T. Gong, T. Fan, L. Pei, Z. Cai, Magnetic resonance imaging-clonal selection algorithm: An intelligent adaptive enhancement of brain image with an improved immune algorithm, Eng. Appl. Artif. Intell., 62 (2017), 405-411. doi: 10.1016/j.engappai.2016.10.004
    [10] D. González-Patiñ o, Y. Villuendas-Rey, A. J. Argüelles-Cruz, O. Camacho-Nieto, C. Yáñ ez-Márquez, AISAC: an artificial immune system for associative classification applied to breast cancer detection, Appl. Sci., 10 (2020), 1-22.
    [11] L. Liu, Q. Dou, H. Chen, J. Qin, P. A. Heng, Multi-task deep model with margin ranking loss for lung nodule analysis, IEEE Trans. Med. Imaging, 39 (2020), 718-728. doi: 10.1109/TMI.2019.2934577
    [12] Z. Zhou, S. Li, G. Qin, M. Folkert, S. Jiang, J. Wang, Multi-objective-based radiomic feature selection for lesion malignancy classification, IEEE J. Biomed. Health Inf., 24 (2020), 194-204. doi: 10.1109/JBHI.2019.2902298
    [13] J. Z. Farkas, G. T. Smith, G. F. Webb, A dynamic model of CT scans for quantifying doubling time of ground glass opacities using histogram analysis, Math. Biosci. Eng., 15 (2018), 1203-1224. doi: 10.3934/mbe.2018055
    [14] A. M. Santos, A. Filho, A. C. D. Paiva, R. A. Nunesb, M. Gattassc, Automatic detection of small lung nodules in 3D CT data using Gaussian mixture models, tsallis entropy and SVM, Eng. Appl. Artif. Intell., 36 (2014), 27-39.
    [15] N. Bi, J. Tan, J. Lai, J. H. Lai, C. Y. Suen, High-dimensional supervised feature selection via optimized kernel mutual information, Expert Syst. Appl., 108 (2018), 81-95. doi: 10.1016/j.eswa.2018.04.037
    [16] F. V. Farahani, A. Ahmadi, M. H. F. Zarandi, Hybrid intelligent approach for diagnosis of the lung nodule from CT images using spatial kernelized fuzzy c-means and ensemble learning, Math. Comput. Simul., 149 (2018), 48-68. doi: 10.1016/j.matcom.2018.02.001
    [17] I. Ali, G. R. Hart, G. Gunabushanam, Y. Liang, W. Muhammad, B. Nartowt, et al., Lung nodule detection via deep reinforcement learning, Front. Oncol., 8 (2018), 108.
    [18] C. A. de P. Pinheiro, N. Nedjah, L. de M. Mourelle, Detection and classification of pulmonary nodules using deep learning and swarm intelligence, Multimedia Tools Appl., 79 (2020), 15437-15465. doi: 10.1007/s11042-019-7473-z
    [19] F. Abdolali, J. Kapur, J. L. Jaremko, M. Noga, A. R. Hareendranathan, K. Punithakumar, Automated thyroid nodule detection from ultrasound imaging using deep convolutional neural networks, Comput. Biol. Med., 122 (2020), 103871. doi: 10.1016/j.compbiomed.2020.103871
    [20] M. R. G. Raman, N. Somua, K. Kirthivasan, R. Liscano, V. S. S. Sriram, An efficient intrusion detection system based on hypergraph-Genetic algorithm for parameter optimization and feature selection in support vector machine, Knowl.-Based Syst., 134 (2017), 1-12. doi: 10.1016/j.knosys.2017.07.005
    [21] A. R. Jordehi, A chaotic artificial immune system optimisation algorithm for solving global continuous optimisation problems, Neural Comput. Appl., 26 (2015), 827-833. doi: 10.1007/s00521-014-1751-5
    [22] S. Wu, H. D. Wan, S. K. Shukla, B. Li, Chaos-based improved immune algorithm (CBⅡA) for resource-constrained project scheduling problems, Expert Syst. Appl., 38 (2011), 3387-3395. doi: 10.1016/j.eswa.2010.08.124
    [23] H. Liang, F. Kang, Adaptive chaos parallel clonal selection algorithm for objective optimization in WTA application, Optik, 127 (2016), 3459-3465. doi: 10.1016/j.ijleo.2015.12.122
    [24] K. Chellapilla, Combining mutation operators in evolutionary programming, IEEE Trans. Evol. Comput., 2 (1998), 91-96. doi: 10.1109/4235.735431
    [25] Z. Wang, J. Xin, P. Sun, Z. Lin, Y. Yao, X. Gao, Improved lung nodule diagnosis accuracy using lung CT images with uncertain class, Comput. Methods Programs Biomed., 162 (2018), 197-209. doi: 10.1016/j.cmpb.2018.05.028
    [26] Q. Mao, S. Zhao, Q. Zheng, S. Su, L. Li, X. Zhang, Modified Gaussian models for pulmonary nodule simulation in chest tomosynthesis, J. Med. Imaging Health Inf., 8 (2018), 1718-1725. doi: 10.1166/jmihi.2018.2498
    [27] S. G. Armato Ⅲ, G. McLennan, L. Bidaut, M. F. McNitt-Gray, C. R. Meyer, A. P. Reeves, et al., Data from LIDC-IDRI, The cancer imaging archive, 2015. Available from: http://doi.org/10.7937/K9/TCIA.2015.LO9QL9SX.
    [28] M. Hilts, C. Duzenli, Image filtering for improved dose resolution in CT polymer gel dosimetry, Med. Phys., 31 (2004), 39-49.
    [29] Q. Mao, S. Zhao, T. Gong, Q. Zheng, An effective hybrid windowed Fourier filtering and fuzzy c-mean for pulmonary nodule segmentation, J. Med. Imaging Health Inf., 8 (2018), 72-77. doi: 10.1166/jmihi.2018.2235
    [30] Q. Mao, S. Zhao, Modified rolling-ball method for pulmonary parenchyma segmentation, J. Med. Imaging Health Inf., 10 (2020), 364-369. doi: 10.1166/jmihi.2020.2882
    [31] J. k. Liu, H. Jiang, M. Gao, C. He, Y. Wang, P. Wang, et al., An assisted diagnosis system for detection of early pulmonary nodule in computed tomography images, J. Med. Syst., 41 (2017), 30.
    [32] X. L. Liu, F. Hou, H. Qin, A. Hao, A CADe system for nodule detection in thoracic CT images based on artificial neural network, Sci. China Inf. Sci., 60 (2017), 072106. doi: 10.1007/s11432-016-9008-0
    [33] A. Kaya, Cascaded classifiers and stacking methods for classification of pulmonary nodule characteristics, Comput. Methods Prog. Biomed., 166 (2018), 77-89. doi: 10.1016/j.cmpb.2018.10.009
    [34] S. A. Khan, S. Hussain, S. Yang, K. Iqbal, Effective and reliable framework for lung nodules detection from CT scan images, Sci. Rep., 9 (2019), 1-14.
    [35] G. Zhang, X. Liu, D. Zhu, P. He, L. Liang, Y. Luo, et al., 3D spatial pyramid dilated network for pulmonary nodule classification, Symmetry-Basel, 10 (2018), 376.
    [36] H. Jiang, F. Gao, X. Xu, F. Huang, S. Zhu, Attentive and ensemble 3D dual path networks for pulmonary nodules classification, Neurocomputing, 398 (2020), 422-430. doi: 10.1016/j.neucom.2019.03.103
  • This article has been cited by:

    1. Yudong Zhang, Juan Manuel Gorriz, Deepak Ranjan Nayak, Optimization Algorithms and Machine Learning Techniques in Medical Image Analysis, 2023, 20, 1551-0018, 5917, 10.3934/mbe.2023255
    2. Wei Zhang, Lili An, Xiaojun Li, Feng Chen, Lidong Sun, Xunming Wang, Jie Cai, Adjustment method and energy consumption of centrifugal pump based on intelligent optimization algorithm, 2022, 8, 23524847, 12272, 10.1016/j.egyr.2022.09.031
    3. Iftikhar Naseer, Sheeraz Akram, Tehreem Masood, Arfan Jaffar, Muhammad Adnan Khan, Amir Mosavi, Performance Analysis of State-of-the-Art CNN Architectures for LUNA16, 2022, 22, 1424-8220, 4426, 10.3390/s22124426
    4. XiaoQing Zhang, GuangYu Wang, Shu-Guang Zhao, CapsNet-COVID19: Lung CT image classification method based on CapsNet model, 2022, 19, 1551-0018, 5055, 10.3934/mbe.2022236
    5. Hafiz Muhammad Tayyab Khushi, Tehreem Masood, Arfan Jaffar, Sheeraz Akram, A Novel Approach to Classify Brain Tumor with an Effective Transfer Learning based Deep Learning Model, 2024, 67, 1678-4324, 10.1590/1678-4324-2024231137
    6. Hafiz Muhammad Tayyab Khushi, Tehreem Masood, Arfan Jaffar, Sheeraz Akram, Sohail Masood Bhatti, Performance analysis ofstate‐of‐the‐art CNNarchitectures for brain tumour detection, 2024, 34, 0899-9457, 10.1002/ima.22949
    7. Jintao Wang, Mao Qi, Zhenwu Xiang, Yi Tian, Dongbing Tong, SaraNet: Semantic aggregation reverse attention network for pulmonary nodule segmentation, 2024, 177, 00104825, 108674, 10.1016/j.compbiomed.2024.108674
    8. Wenfeng Wang, Qi Mao, Yi Tian, Yan Zhang, Zhenwu Xiang, Lijia Ren, FMD-UNet: fine-grained feature squeeze and multiscale cascade dilated semantic aggregation dual-decoder UNet for COVID-19 lung infection segmentation from CT images, 2024, 10, 2057-1976, 055031, 10.1088/2057-1976/ad6f12
    9. Tolga Hayit, Ali Endes, Fatma Hayit, The severity level classification of Fusarium wilt of chickpea by pre-trained deep learning models, 2023, 2239-7264, 10.1007/s42161-023-01520-z
    10. Hafiz Muhammad Tayyab Khushi, Tehreem Masood, Arfan Jaffar, Muhammad Rashid, Sheeraz Akram, Improved Multiclass Brain Tumor Detection via Customized Pretrained EfficientNetB7 Model, 2023, 11, 2169-3536, 117210, 10.1109/ACCESS.2023.3325883
    11. Zhenwu Xiang, Qi Mao, Jintao Wang, Yi Tian, Yan Zhang, Wenfeng Wang, Dmbg-Net: Dilated multiresidual boundary guidance network for COVID-19 infection segmentation, 2023, 20, 1551-0018, 20135, 10.3934/mbe.2023892
    12. 一展 刘, Classification of Pulmonary Nodules Based on 3D CMFF Deep Network Model, 2024, 13, 2324-8696, 588, 10.12677/MOS.2024.131057
    13. Yan Zhang, Qi Mao, Yi Tian, Wenfeng Wang, Lijia Ren, Haibo Li, Dual-path information enhanced pyramid Unet for COVID-19 lung infection segmentation, 2025, 142, 09521976, 109977, 10.1016/j.engappai.2024.109977
    14. Yi Tian, Qi Mao, Wenfeng Wang, Yan Zhang, Hierarchical agent transformer network for COVID-19 infection segmentation, 2025, 11, 2057-1976, 025055, 10.1088/2057-1976/adbafa
  • Reader Comments
  • © 2021 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(2981) PDF downloads(83) Cited by(14)

Figures and Tables

Figures(7)  /  Tables(6)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog