Research article Special Issues

Lung radiomics features for characterizing and classifying COPD stage based on feature combination strategy and multi-layer perceptron classifier


  • Computed tomography (CT) has been the most effective modality for characterizing and quantifying chronic obstructive pulmonary disease (COPD). Radiomics features extracted from the region of interest in chest CT images have been widely used for lung diseases, but they have not yet been extensively investigated for COPD. Therefore, it is necessary to understand COPD from the lung radiomics features and apply them for COPD diagnostic applications, such as COPD stage classification. Lung radiomics features are used for characterizing and classifying the COPD stage in this paper. First, 19 lung radiomics features are selected from 1316 lung radiomics features per subject by using Lasso. Second, the best performance classifier (multi-layer perceptron classifier, MLP classifier) is determined. Third, two lung radiomics combination features, Radiomics-FIRST and Radiomics-ALL, are constructed based on 19 selected lung radiomics features by using the proposed lung radiomics combination strategy for characterizing the COPD stage. Lastly, the 19 selected lung radiomics features with Radiomics-FIRST/Radiomics-ALL are used to classify the COPD stage based on the best performance classifier. The results show that the classification ability of lung radiomics features based on machine learning (ML) methods is better than that of the chest high-resolution CT (HRCT) images based on classic convolutional neural networks (CNNs). In addition, the classifier performance of the 19 lung radiomics features selected by Lasso is better than that of the 1316 lung radiomics features. The accuracy, precision, recall, F1-score and AUC of the MLP classifier with the 19 selected lung radiomics features and Radiomics-ALL were 0.83, 0.83, 0.83, 0.82 and 0.95, respectively. It is concluded that, for the chest HRCT images, compared to the classic CNN, the ML methods based on lung radiomics features are more suitable and interpretable for COPD classification. In addition, the proposed lung radiomics combination strategy for characterizing the COPD stage effectively improves the classifier performance by 12% overall (accuracy: 3%, precision: 3%, recall: 3%, F1-score: 2% and AUC: 1%).

    Citation: Yingjian Yang, Wei Li, Yingwei Guo, Nanrong Zeng, Shicong Wang, Ziran Chen, Yang Liu, Huai Chen, Wenxin Duan, Xian Li, Wei Zhao, Rongchang Chen, Yan Kang. Lung radiomics features for characterizing and classifying COPD stage based on feature combination strategy and multi-layer perceptron classifier[J]. Mathematical Biosciences and Engineering, 2022, 19(8): 7826-7855. doi: 10.3934/mbe.2022366

    Related Papers:

    [1] Warin Rangubpit, Sunan Kitjaruwankul, Panisak Boonamnaj, Pornthep Sompornpisut, R.B. Pandey . Globular bundles and entangled network of proteins (CorA) by a coarse-grained Monte Carlo simulation. AIMS Biophysics, 2019, 6(2): 68-82. doi: 10.3934/biophy.2019.2.68
    [2] Ken Takahashi, Takayuki Oda, Keiji Naruse . Coarse-grained molecular dynamics simulations of biomolecules. AIMS Biophysics, 2014, 1(1): 1-15. doi: 10.3934/biophy.2014.1.1
    [3] Tomohiro Yanao, Sosuke Sano, Kenichi Yoshikawa . Chiral selection in wrapping, crossover, and braiding of DNA mediated by asymmetric bend-writhe elasticity. AIMS Biophysics, 2015, 2(4): 666-694. doi: 10.3934/biophy.2015.4.666
    [4] Warin Rangubpit, Pornthep Sompornpisut, R.B. Pandey . Thermal-induced unfolding-refolding of a nucleocapsid COVN protein. AIMS Biophysics, 2021, 8(1): 103-110. doi: 10.3934/biophy.2021007
    [5] James C. L. Chow . Recent progress in Monte Carlo simulation on gold nanoparticle radiosensitization. AIMS Biophysics, 2018, 5(4): 231-244. doi: 10.3934/biophy.2018.4.231
    [6] Masoud Jabbary, Hossein Rajabi . High-resolution phantom of a nephron for radiation nephrotoxicity evaluation in biophysical simulations. AIMS Biophysics, 2022, 9(2): 147-160. doi: 10.3934/biophy.2022013
    [7] Nily Dan . Bilayer degradation in reactive environments. AIMS Biophysics, 2017, 4(1): 33-42. doi: 10.3934/biophy.2017.1.33
    [8] Zaynah Sheeraz, James C.L. Chow . Evaluation of dose enhancement with gold nanoparticles in kilovoltage radiotherapy using the new EGS geometry library in Monte Carlo simulation. AIMS Biophysics, 2021, 8(4): 337-345. doi: 10.3934/biophy.2021027
    [9] Carl S. Helrich . Studies of cholesterol structures in phospholipid bilayers. AIMS Biophysics, 2017, 4(3): 415-437. doi: 10.3934/biophy.2017.3.415
    [10] Wei Zhang, Sheng Cao, Jessica L. Martin, Joachim D. Mueller, Louis M. Mansky . Morphology and ultrastructure of retrovirus particles. AIMS Biophysics, 2015, 2(3): 343-369. doi: 10.3934/biophy.2015.3.343
  • Computed tomography (CT) has been the most effective modality for characterizing and quantifying chronic obstructive pulmonary disease (COPD). Radiomics features extracted from the region of interest in chest CT images have been widely used for lung diseases, but they have not yet been extensively investigated for COPD. Therefore, it is necessary to understand COPD from the lung radiomics features and apply them for COPD diagnostic applications, such as COPD stage classification. Lung radiomics features are used for characterizing and classifying the COPD stage in this paper. First, 19 lung radiomics features are selected from 1316 lung radiomics features per subject by using Lasso. Second, the best performance classifier (multi-layer perceptron classifier, MLP classifier) is determined. Third, two lung radiomics combination features, Radiomics-FIRST and Radiomics-ALL, are constructed based on 19 selected lung radiomics features by using the proposed lung radiomics combination strategy for characterizing the COPD stage. Lastly, the 19 selected lung radiomics features with Radiomics-FIRST/Radiomics-ALL are used to classify the COPD stage based on the best performance classifier. The results show that the classification ability of lung radiomics features based on machine learning (ML) methods is better than that of the chest high-resolution CT (HRCT) images based on classic convolutional neural networks (CNNs). In addition, the classifier performance of the 19 lung radiomics features selected by Lasso is better than that of the 1316 lung radiomics features. The accuracy, precision, recall, F1-score and AUC of the MLP classifier with the 19 selected lung radiomics features and Radiomics-ALL were 0.83, 0.83, 0.83, 0.82 and 0.95, respectively. It is concluded that, for the chest HRCT images, compared to the classic CNN, the ML methods based on lung radiomics features are more suitable and interpretable for COPD classification. In addition, the proposed lung radiomics combination strategy for characterizing the COPD stage effectively improves the classifier performance by 12% overall (accuracy: 3%, precision: 3%, recall: 3%, F1-score: 2% and AUC: 1%).



    The outbreak of COVID 19 pandemic has caused an urgent need to understand a wide range of issues, both basic and applied, from epidemiology to diverse multi-scale molecular structures of its constitutive elements of the novel coronavirus (CoV-2) and its interaction with surroundings [1][4]. The RNA genome of CoV-2 is believed to encode only four major structural, sixteen non-structural and eleven accessory proteins. The four structural proteins including spike (S) protein, nucleocapsid (N) protein, membrane (M) protein, and the envelope (E) protein are key components for viral transmission and reproduction [4][9]. The CoVE protein is the smallest with 76 amino acid residues (1M2Y...76V) (for sequence see https://www.uniprot.org/uniprot/P59637) [10] but plays an essential role in replication cycle of the virus such as formation of envelope as an integral membrane protein in viral assembly, budding and pathogenesis [4][16].

    Very recently, Kuzmin et al. [17] have examined the conformational variability of CoVE protein in presence of a membrane environment using an all-atom MD along with a coarse-grained simulation. Because of ‘prohibitively long time’ with the atomistic simulation in ‘exhaustive sampling of the conformational space’ utility of coarse-grained approach implemented in this study is very well emphasized. They find that CoVE induces curvature in membrane and ‘it has a preferable position’ in already curved membranes with ‘the C-termini localize to the convex region’. Apart from locating specific segments of the protein from the membrane sites, there is no estimate of the overall size of the protein or the underlying dynamics. For example, how the segments of protein perform its stochastic moves and how the overall dynamics (diffusive, non-diffusive, drift etc.) of the protein (i.e. that of its center of mass) sets in as its segments settles in the curved region. How its conformation and dynamics depends on temperature. It may be useful to explore the structure and dynamics of a protein monomer free from membrane as a function of temperature first. Then incorporate the environment such as membrane and other constitutive elements such as solvent to assess its effect on the structure and dynamics. We would like to focus on the folding dynamics of a free CoVE protein, a purely theoretical exercise as an idealized model system, the characteristic of which may be useful in hypothesizing its role in interpreting the laboratory data. Such a basic investigation has not been performed to our knowledge as presented here.

    Sequence of CoVE protein and its specific characteristics such as formation of membrane ion channel, virulence intensity, etc. and other proteins are studied in depth [4][16]. Three distinct segments of the CoVE protein start with the hydrophilic amino (N) terminal with about 10 residues (1M–10G), followed by the hydrophobic transmembrane (TM) segment of about 25–27 residues (11T–37L), and ends with a relatively large hydrophilic carboxyl terminal consisting of the remaining residues (38R–76V). The structural details of how these segments interact, compete and cooperate is not fully known. The objective of this article is to investigate the effect of temperature on the structural dynamics by analyzing some of the local and global physical properties by a coarse-grained (CG) Monte Carlo simulation. The model and methods section is described in brief next followed by results and discussion and a conclusion.

    The model adopted here is based on an effective and efficient method involving bond-fluctuating mechanism on a discrete lattice with ample degrees of freedom to investigate complex issues in probing long time properties in polymer systems [18] that cannot be accessed by traditional MC or MD in a continuum space. This coarse-grained approach has been further developed over the years [19][21] with options to fine-grain and incorporate the specificity of amino acids to model proteins. The relevant detail has been already described to examine structural dynamics of a number of proteins [19][21]. In coarse-grained description, we constructed a CG protein model of CoVE containing a chain of 76 amino acid residues [10] in a specific sequence on a discrete cubic lattice with ample degrees of freedom to describe essential motion and fluctuation for each residue in the protein. A residue is modeled using a single CG node with a cubic node of size (2a)3 where a refers to the lattice constant. Consecutive nodes are tethered together by fluctuating covalent bonds of which the bond length between consecutive nodes varies between 2 and √(10) in unit of lattice constant (a). A generalized Lennard-Jones potential (Uij) is used to describe the interaction between each residue and surrounding residues within a range (rc) as:

    Uij=[|ϵij|(σγij)12+ϵij(σγij)6],rij<r

    where rij is the distance between the amino acid residues at site i and j; rc = √8 and σ = 1 in units of lattice constant. The potential strength eij in phenomenological interaction (1) is based on a knowledge-based [22] residue-residue contact matrix which has been developed over many years [23][27] from a huge (and rapidly growing) ensemble of protein structures in Protein Data Bank (PDB).

    The protein chain is placed in a random configuration initially since one can reach any microstate from any other in the vast landscape of statistical ensemble in thermodynamic equilibrium—a fundamental principle of statistical physics. Therefore, starting configuration is irrelevant. Each residue undergoes its stochastic movement with the Metropolis algorithm. The protein configurations are thus generated by movements of each residue with a probability proportional to the Boltzmann factor exp(–ΔE/T) where ΔE is the change in energy between new and old configurations of the underlying residues subject to strictly imposed excluded volume constraints. Monte Carlo time step is defined by attempts to move each residue once. Simulation parameters i.e. length in unit of lattice constant are measured in arbitrary unit and the temperature T in reduced units of the Boltzmann constant.

    It is important to point out that connecting the arbitrary units with the real sample size and temperature i.e. Kelvin, and time scale (micro-second) quantitatively is difficult due to phenomenological nature of the residue-residue interaction and the coarse-grained representation. Calibration of the parameters used in simulation requires measurements of some common physical quantities in both computer simulations as well as laboratory experiments which is not feasible. However, based on the approximate size of residue and its representation and response of the physical quantities with respect to changes in temperature, one can estimate the order of magnitude.

    All simulations reported here are performed on a 1503 cubic lattice for a range of temperature T = 0.010–0.030 with 100 independent samples each with 10 million time steps; different sample sizes are also used to make sure that the qualitative results are independent of finite size effects. Both local and global physical quantities such as radius of gyration (Rg), root mean square displacement (RMSD) of the center of mass, structure factor (S), contact map, etc. are investigated as a function of reduced temperature. In the following ‘temperature’ refers to ‘reduced temperature’ and should not be confused with the absolute value.

    Snapshots of the protein (CoVE) at representative temperatures (T = 0.020, 0.024, 0.030) presented in Figure 1 illustrate the conformational changes as it denatures on raising the temperature. Obviously the protein conforms to a globular (compact) structure at low temperature (T = 0.020) and expands as it denatures (T = 0.030). The segmental globularity in the transmembrane segment of the protein is retained at least in part even at the high temperature (see below). Note that snapshots do not represent the average conformation; it is an illustrative mean that needs to be quantified. Therefore, some quantitative measures of segmental globularity are highly desirable. Average number (Nn) of residues within the range of interaction of each along the contour of the protein may provide some insight into the local restructuring as a function of temperature.

    Figure 1.  Snapshot of CoVE protein at the end of 107 MCS time. Large black and grey sphere represents the first residue (1M) and the last residue (76V) of the protein, respectively. Smaller golden spheres represent the presence of other residues within the range of interaction excluding the consecutively connected residues. Backbone grey lines are covalent bonds connecting the residues. Snapshot at the left (T = 0.020), center (T = 0.024), right (T = 0.030) are at the representative temperatures to illustrate denaturing of the protein.
    Figure 2.  Residue contact profiles show average number (Nn) of surrounding residues around each residue of the protein within the range of interaction at a range of temperature (T = 0.012–0.024).

    The contact profiles at representative temperatures are shown in Figure 2. At low temperatures (T = 0.012–0.020, in native phase), the coagulating centers are distributed sporadically particularly in N- and C-terminal regions along the backbone (e.g. 1M2Y, 12L13I14V (in N-terminal), 51L52V53K, 56V57Y58V59Y, 74L75L76V (in C-terminus)) with relatively large Nn. The transmembrane segment (18L–44C), however, has the largest globular cluster with high number Nn of surrounding residues. Clearly, there is no significant change in contact profiles with the temperature in native phase. However, raising the temperature (T = 0.024) further leads to a substantial decrease in segmental globularity in both N- and C-terminals with low contact number Nn while retaining some globularity in TM segment with a relatively large Nn (see Figure 2). Thus, three distinct (N- and C-terminals separated by TM) segments can be identified easily from the evolution of contact profiles with the temperature.

    The global structure and dynamics of the protein emerge from the stochastic movement of each residues and their configurational stabilities. Variation of the root mean square displacement of the center of mass (Rc) with the time step (t) at a range of temperature presented in Figure 3 show, how the asymptotic dynamics depends on the temperature. One can identify the type of dynamics by estimating the power-law exponent (ν) by fitting the data with a scaling, Rc ∝ tν where ν = ½ represents diffusion. Accordingly, the protein is found to move extremely slowly at low temperatures with ultra-sub-diffusive dynamics (i.e. ν = 0.05 at T = 0.020) and resumes diffusion at higher temperatures (i.e. ν = 0.47 at T = 0.024, see Figure 3). Dynamics do affect the conformation of the protein which can be quantified by analyzing radius of gyration and the structure factor (see below).

    Figure 3.  Average root mean square displacement (RMSD) of the center of mass (Rc) of the protein with the time step (t) versus temperature in range , T = 0.012–0.030 on a log-log scale. Slopes of the fitted data (solid black lines) in the asymptotic regime at a low (T = 0.012) and a high (T = 0.024) temperature are included with corresponding estimates.

    How do we know that the protein chain has reached its steady-state equilibrium in 10 million time steps? It is important to check the convergence of physical quantities to make sure that system has reached steady-state within the range of simulation time steps in all such simulations. We have done here as in all our previous studies. For an illustration, variation of the radius of gyration with the time steps for a range of temperature is presented in Figure 4. We see that the radius of gyration has reached its steady-state equilibrium at almost all temperatures. It is not feasible to reach as good steady-state at low temperatures as is the case in most such interacting systems and even in laboratories. However, the probability of reaching the same asymptotic magnitude of Rg at all temperatures (T below 0.020) in sub-native phase (i.e. T = 0.018, 0.016 in Figure 4) is extremely low. Since it is not feasible to extend this simulation for such a long asymptotic time regime, it is worth issuing a caution to very cautious readers. The data on Rg in this sub-native phase (T = 0.010–0.020) in the following Figures should be therefore considered with caution.

    Figure 4.  Variation of the average radius of gyration (Rg) of the protein with the time step (t) for a temperature range, T = 0.016–0.030.

    The radius of gyration (Rg) of the protein is a measure of its average size. Variation of the average (over configurations from 5 million to 10 million time steps each with 100 independent samples) radius of gyration (Rg) with the temperature (T) is presented in Figure 5. In contrast to denature phase (T = 0.020–0.030) where the radius of gyration (Rg) increases on raising the temperature, it decays on increasing the temperature in the native phase (T = 0.010–0.020). Why the protein contract on raising the temperature in sub-native phase? At very low temperatures, segmental interactions dominate over thermal noise. On raising the temperature, the frozen or frustrated trapped segments begin to explore and equilibrate better and therefore result into a more compact conformation. This process continues until a certain temperature beyond which thermal noise take over the segmental interactions with the temperature and protein begins to expand. Such an opposite thermal response in native and denature phases is also observed in other membrane proteins in recent years [20]. Based on a crude calibration of the measurements of the physical quantities with the same coarse-grained approach and an all-atom MD simulation [20], we believe that the range of temperature T = 0.020–0.030 may lie in the range of T = 300–400 K roughly. In order to claim such thermal response, a universal or sub-universal characteristics for the membrane proteins, more studies on many different membrane proteins are needed. The thermal response of the nucleocapsid protein (CoVN) of COVID 19 is also non-monotonic [21] but very different from that of CoVE. We hope this results will stimulate further investigations in this direction.

    It is worth pointing out in passing that the validation of this model is based on the response of protein at extreme temperature where residue-residue interaction becomes irrelevant and conformation of a protein chain behaves similar to that of an athermal polymer chain. That is the case here as in all our previous work based on this coarse-grained model. Conformational dynamics of a range of proteins with various sizes are examined using this coarse-grained approach and observed different thermal response of proteins with comparable size. For example, proteins H3.1 and H2AX [19] are about the same size (136 and 140 residues) and exhibit very different thermal response to temperature: on raising the temperature, Rg of H3.1 increases continuously before saturation while Rg of H2AX increases to a maximum value followed by decay (a non-monotonic response different from the one observed here for CoVE). Thus, the specificity of residues and its distribution (i.e. sequence) in a protein pays a critical role in their collective response. Unfortunately, we do not have a data on a protein of the COVID related protein with the size comparable to CoVE to compare here. However, based on the analysis on many proteins (some cited in this study) with this approach, the conclusion on the thermal response of CoVE is reliable. A reader can always check it out by carry out simulation independently on an idealized model like the one presented here.

    Figure 5.  Average radius of gyration (Rg) of CoVE versus temperature.

    Analysis of the structure factor can also provide insight into the global conformation of the protein. The structure factor (S(q)) is defined as:

    S(q)=1N|Nj=1eiqrj|2|q|

    where rj is the position of each residue. The magnitude of the wave vector (q) of wavelength λ is given by |q| = 2π/λ. Using a power-law scaling for the structure factor with the wave vector, S(q) ∝ q–1/γ, one can evaluate the exponent γ. Variations of the structure factor with the wavelength (comparable to radius of gyration) is presented in Figure 5 for representative temperatures.

    In general, the radius of gyration (Rg) of a polymer chain shows a power-law scaling with number (N) of monomer (residues), Rg ∝ Nx with a well-defined exponent x. Since a protein is a hetero-polymer chain of residues, one may use the same scaling law here for the protein with the radius of gyration comparable to wavelength (Rg ~ λ). Then the power-law exponent x = γ and one may be able to estimate the globularity index g which is the effective dimension (D) for the spread of the protein (g ≅ D). Since, N ∝ RgD, D = 1/γ. Higher magnitude of D represents more compact structure with higher degree of globularity. Note that this scaling analysis is an estimate and should not be taken as a precise measurement. Obviously D cannot be higher than 3, however high value indicates more compact conformations. Clearly, the protein remains globular (three dimensional) at low temperatures in native phase and it denatures into a ramified random coil (D ~ 2) structures at high temperatures (see Figure 6). It is important to point out that the such a vast changes in conformation of the protein in sub-native i.e. low temperature (T = 0.010–0.020) to native-to-denature phase i.e. at higher temperature regime (T = 0.020–0.030) may pose the possibility of COVE to be an intrinsically disordered protein at least in some range of temperature and time scales. For example, the segmental interactions in the native phase may be somewhat similar to interaction of a protein with its underlying environment e.g. a specific surface [28] even in the absence of explicit external surface.

    Figure 6.  Structure factor (S) of the protein versus wave length (lambda) at representative low and high temperatures. Data points are selected with wavelength comparable to radius of gyration of the protein at corresponding temperatures with the estimates of slopes. The number along the axes are the range of the axes (starting and end points).

    Monte Carlo simulations are performed with a coarse-grained model of envelope membrane protein CoVE of the human coronavirus to assess its structural variability as a function of temperature. The evolution of the protein's conformation as a function of temperature is clearly seen in visualization via snapshots and animations. Segmental organizing of residues is quantified by examining local physical quantities such as average contact and mobility profiles as a function of temperature. From the variations in contact profiles as a function of temperature, it is feasible to identify three distinct segments (N-terminal and C-terminal separated by a transmembrane segment) which is consistent with the previous studies on CoVE [4],[5],[11][16]. The radius of gyration exhibits a non-monotonic thermal response: the protein CoVE contracts on heating in native phase and expands continuously on raising the temperature before reaching a saturation in denatured phase. Such a unique non-monotonic thermal response has been recently observed in other membrane proteins (unrelated to coronavirus). This may lead to speculate that thermal response may be a unique characteristic to identify different classes of proteins—obviously more studies are needed to confirm this observation. Thermal response of CoVE is different from another structural protein CoVN (a nucleocapsid protein) of the coronavirus [21]. Variation of the structure factor with the wave vector is studied in detail for a wide range of temperature. Scaling analysis of the structure factor provides a quantitative estimate of the overall globularity with an effective dimension (D). CoVE is found to remain globular (D ~ 3) in its native phase and unfolds continuously to a random coil (D ~ 2) conformation in denature phase on raising the temperature.



    [1] A. G. Mathioudakis, G. A. Mathioudakis, The phenotypes of chronic obstructive pulmonary disease, Arch. Hellenic Med., 31 (2014), 558-569. https://doi.org/10.1080/15412550701629663 doi: 10.1080/15412550701629663
    [2] GOLD 2022: Global initiative for chronic obstructive lung disease, 2022.
    [3] D. A. Suffredini, R. M. Reed, At the twisted heart of nicotine addiction, BMJ Case Rep., 2012. https://doi.org/10.1136/bcr-2012-006240 doi: 10.1136/bcr-2012-006240
    [4] P. W. Jones, Health status measurement in chronic obstructive pulmonary disease, Thorax, 56 (2001). https://doi.org/10.1201/9780203913406-14 doi: 10.1201/9780203913406-14
    [5] C. D. Brown, J. O. Benditt, F. C. Sciurba, S. M. Lee, G. J. Criner, Z. Mosenifar, et al., Exercise testing in severe emphysema: association with quality of life and lung function, COPD J. Chron. Obstruct. Pulm. Dis., 5 (2008), 117-124. https://doi.org/10.1080/15412550801941265 doi: 10.1080/15412550801941265
    [6] D. A. Lynch, Progress in Imaging COPD, 2004-2014, Chron. Obstruct. Pulm. Dis.: J. COPD Found., 1 (2014), 73-82. https://doi.org/10.15326/jcopdf.1.1.2014.0125 doi: 10.15326/jcopdf.1.1.2014.0125
    [7] P. J. Castaldi, R. S. J. Estépar, C. S. Mendoza, C. P. Hersh, N. Laird, J. D. Crapo, et al., Distinct quantitative computed tomography emphysema patterns are associated with physiology and function in smokers, Am. J. Respir. Crit. Care Med., 188 (2013), 1083-1090. https://doi.org/10.1164/rccm.201305-0873oc doi: 10.1164/rccm.201305-0873oc
    [8] T. B. Grydeland, A. Dirksen, H. O. Coxson, T. M. Eagan, E. Thorsen, S. G. Pillai, et al., Quantitative computed tomography measures of emphysema and airway wall thickness are related to respiratory symptoms, Am. J. Respir. Crit. Care Med., 181 (2010), 353-359. https://doi.org/10.1164/rccm.200907-1008oc doi: 10.1164/rccm.200907-1008oc
    [9] V. Kim, A. Davey, A. P. Comellas, M. K. Han, G. Washko, C. H. Martinez, et al., Clinical and computed tomographic predictors of chronic bronchitis in COPD: a cross Sectional analysis of the COPDGene study, Respir. Res., 15 (2014), 1-9. https://doi.org/10.1186/1465-9921-15-52 doi: 10.1186/1465-9921-15-52
    [10] S. P. Bhatt, N. L. Terry, H. Nath, J. A. Zach, J. Tschirren, M. S. Bolding, et al., Association between expiratory central airway collapse and respiratory outcomes among smokers, Jama, 315 (2016), 498-505. https://doi.org/10.1164/rccm.202008-3122le doi: 10.1164/rccm.202008-3122le
    [11] C. P. Hersh, G. R. Washko, R. S. J. Estépar, S. Lutz, P. J. Friedman, M. K. Han, et al., Paired inspiratory-expiratory chest CT scans to assess for small airways disease in COPD, Respir. Res., 14 (2013), 1-11. https://doi.org/10.1164/ajrccm-conference.2012.185.1_meetingabstracts.a6539 doi: 10.1164/ajrccm-conference.2012.185.1_meetingabstracts.a6539
    [12] S. Bodduluri, J. M. Reinhardt, E. A. Hoffman, J. D. Newell Jr, H. Nath, M. T. Dransfield, et al., Signs of gas trapping in normal lung density regions in smokers, Am. J. Respir. Crit. Care Med., 196 (2017), 1404-1410. https://doi.org/10.1164/rccm.201705-0855oc doi: 10.1164/rccm.201705-0855oc
    [13] C. J. Galbán, M. K. Han, J. L. Boes, K. A. Chughtai, C. R. Meyer, T. D. Johnson, et al. Computed tomography-based biomarker provides unique signature for diagnosis of COPD phenotypes and disease progression, Nat. Med., 18 (2012), 1711-1715. https://doi.org/10.1038/nm.2971 doi: 10.1038/nm.2971
    [14] S. Bodduluri, S. P. Bhatt, E. A. Hoffman, J. D. Newell, C. H. Martinez, M. T. Dransfield, et al., Biomechanical CT metrics are associated with patient outcomes in COPD, Thorax, 72 (2017), 409-414. https://doi.org/10.1136/thoraxjnl-2016-209544 doi: 10.1136/thoraxjnl-2016-209544
    [15] S. P. Bhatt, S. Bodduluri, E. A. Hoffman, J. D. Newell Jr, J. C. Sieren, M. T. Dransfield, et al., Computed tomography measure of lung at risk and lung function decline in chronic obstructive pulmonary disease, Am. J. Respir. Crit. Care Med., 196 (2017), 569-576. https://doi.org/10.1164/rccm.201701-0050oc doi: 10.1164/rccm.201701-0050oc
    [16] G. R. Washko, G. L. Kinney, J. C. Ross, R. S. J. Estépar, M. K. Han, M. T. Dransfield, et al., Lung Mass in Smokers, Acad. Radiol., 24 (2016), 386-392. https://doi.org/10.1016/j.acra.2016.10.011 doi: 10.1016/j.acra.2016.10.011
    [17] J. M. Wells, G. R. Washko, M. K. Han, N. Abbas, H. Nath, A. J. Mamary, et al., Pulmonary arterial enlargement and acute exacerbations of COPD, N. Engl. J. Med., 367 (2012), 913-921. https://doi.org/10.1136/thoraxjnl-2013-203397 doi: 10.1136/thoraxjnl-2013-203397
    [18] R. S. J. Estépar, G. L. Kinney, J. L. Black-Shinn, R. P. Bowler, G. L. Kindlmann, J. C. Ross, et al., Computed tomographic measures of pulmonary vascular morphology in smokers and their clinical implications, Am. J. Respir. Crit. Care Med., 188 (2013), 231-239. https://doi.org/10.1164/rccm.201301-0162oc doi: 10.1164/rccm.201301-0162oc
    [19] P. Lambin, E. Rios-Velazquez, R. Leijenaar, S. Carvalho, R. G. Van Stiphout, P. Granton, et al., Radiomics: Extracting more information from medical images using advanced feature analysis, Eur. J. Cancer, 43 (2007), 441-446. https://doi.org/10.1016/j.ejca.2011.11.036 doi: 10.1016/j.ejca.2011.11.036
    [20] A. N. Frix, F. Cousin, T. Refaee, F. Bottari, A. Vaidyanathan, C. Desir, et al., Radiomics in lung diseases imaging: State-of-the-art for clinicians, J. Pers. Med., 11 (2021), 1-20. https://doi.org/10.3390/jpm11070602 doi: 10.3390/jpm11070602
    [21] S. M. Rezaeijo, R. Abedi-Firouzjah, M. Ghorvei, S. Sarnameh, Screening of COVID-19 based on the extracted radiomics features from chest CT images, J. X-Ray Sci. Technol., 29 (2021), 1-15. https://doi.org/10.3233/xst-200831 doi: 10.3233/xst-200831
    [22] F. Xiao, R. Sun, W. Sun, D. Xu, L. Lan, H. Li, et al., Radiomics analysis of chest CT to predict the overall survival for the severe patients of COVID-19 pneumonia, Phys. Med. Biol., 66 (2021), 1-11. https://doi.org/10.1088/1361-6560/abf717 doi: 10.1088/1361-6560/abf717
    [23] F. Xiong, Y. Wang, T. You, H. Li, T. Fu, H. Tan, et al., The clinical classification of patients with COVID-19 pneumonia was predicted by Radiomics using chest CT, Medicine, 100 (2021), 1-8. https://doi.org/10.1097/md.0000000000025307 doi: 10.1097/md.0000000000025307
    [24] M. Tamal, M. Alshammari, M. Alabdullah, R. Hourani, H. A. Alola, T. M. Hegazi, An integrated framework with machine learning and radiomics for accurate and rapid early diagnosis of COVID-19 from chest x-ray, Expert Syst. Appl., 180 (2021), 1-8. https://doi.org/10.1101/2020.10.01.20205146 doi: 10.1101/2020.10.01.20205146
    [25] Y. Tang, T. Zhang, X. Zhou, Y. Zhao, H. Xu, Y. Liu, et al., The preoperative prognostic value of the radiomics nomogram based on CT combined with machine learning in patients with intrahepatic cholangiocarcinoma, World J. Surg. Oncol., 19 (2021), 1-13. https://doi.org/10.1186/s12957-021-02162-0 doi: 10.1186/s12957-021-02162-0
    [26] X. Han, J. Yang, J. Luo, P. Chen, Z. Zhang, A. Alu, et al., Application of CT-based radiomics in discriminating pancreatic cystadenomas from pancreatic neuroendocrine tumors using machine learning methods, Front. Oncol., 11 (2021), 1-13. https://doi.org/10.3389/fonc.2021.606677 doi: 10.3389/fonc.2021.606677
    [27] M. F. A. Chaudhary, E. A. Hoffman, A. P. Comellas, J. Guo, S. Fortis, S. Bodduluri, et al., CT texture features predict severe COPD exacerbations in spiromics, in American Thoracic Society 2021 International Conference, (2021), 1122-1122. https://doi.org/10.1164/ajrccm-conference.2021.203.1_meetingabstracts.a1122
    [28] M. Occhipinti, M. Paoletti, B. J. Bartholmai, S. Rajagopalan, R. A. Karwoski, C. Nardi, et al., Spirometric assessment of emphysema presence and severity as measured by quantitative CT and CT-based radiomics in COPD, Respir. Res., 20 (2019), 1-11. https://doi.org/10.1186/s12931-019-1049-3 doi: 10.1186/s12931-019-1049-3
    [29] G. Wu, A. Ibrahim, I. Halilaj, R. T. Leijenaar, W. Rogers, H. A. Gietema, et al., The emerging role of radiomics in COPD and lung cancer, Respiration, 99 (2020), 99-107. https://doi.org/10.1159/000505429 doi: 10.1159/000505429
    [30] G. Maragatham, S. Rajendran, Improving the classifier accuracy with an integrated approach using medical data-a study, Int. J. Med. Eng. Inf., 12 (2020), 313-321. https://doi.org/10.1504/ijmei.2020.10029317 doi: 10.1504/ijmei.2020.10029317
    [31] D. Lu, Q. Weng, A survey of image classification methods and techniques for improving classification performance, Int. J. Remote Sens., 28 (2007), 823-870. https://doi.org/10.1080/01431160600746456 doi: 10.1080/01431160600746456
    [32] R. C. Au, W. C. Tan, J. Bourbeau, J. C. Hogg, M. Kirby, Impact of image pre-processing methods on computed tomography radiomics features in chronic obstructive pulmonary disease, Phys. Med. Biol., 66 (2021). https://doi.org/10.2139/ssrn.3349696 doi: 10.2139/ssrn.3349696
    [33] J. Yun, Y. H. Cho, S. M. Lee, J. Hwang, J. S. Lee, Y. M. Oh, et al., Deep radiomics-based survival prediction in patients with chronic obstructive pulmonary disease, Sci. Rep., 11 (2021), 1-9. https://doi.org/10.1038/s41598-021-94535-4 doi: 10.1038/s41598-021-94535-4
    [34] R. C. Au, W. C. Tan, J. Bourbeau, J. C. Hogg, M. Kirby, Radiomics Analysis to Predict Presence of Chronic Obstructive Pulmonary Disease and Symptoms Using Machine Learning, in TP121 COPD: FROM CELLS TO THE CLINIC, American Thoracic Society, 2021. https://doi.org/10.1164/ajrccm-conference.2021.203.1_meetingabstracts.a4568
    [35] C. Liang, J. Xu, F. Wang, H. Chen, J. Tang, D. Chen, et al., Development of a radiomics model for predicting COPD exacerbations based on complementary visual information, in TP041 DIAGNOSIS AND RISK ASSESSMENT IN COPD, American Thoracic Society, 2021. https://doi.org/10.1164/ajrccm-conference.2021.203.1_meetingabstracts.a2296
    [36] Y. Yang, W. Li, Y. Guo, Y. Liu, Q. Li, K. Yang, et al., Early COPD risk decision for adults aged from 40 to 79 years based on lung radiomics features, Front. Med., 9 (2022), 1-15. https://doi.org/10.3389/fmed.2022.845286 doi: 10.3389/fmed.2022.845286
    [37] Y. Yang, W. Li, Y. Kang, Y. Guo, K. Yang, Q. Li, et al., A novel lung radiomics feature for characterizing resting heart rate and COPD stage evolution based on radiomics feature combination strategy, Math. Biosci. Eng., 19 (2022), 4145-4165. https://doi.org/10.3934/mbe.2022191 doi: 10.3934/mbe.2022191
    [38] Y. Zhou, P. L. Bruijnzeel, C. McCrae, J. Zheng, U. Nihlen, R. Zhou, et al., Study on risk factors and phenotypes of acute exacerbations of chronic obstructive pulmonary disease in Guangzhou, China-design and baseline characteristics, J. Thorac. Dis., 7 (2015), 720-733. https://doi:10.3978/j.issn.2072-1439.2015.04.14 doi: 10.3978/j.issn.2072-1439.2015.04.14
    [39] J. Hofmanninger, F. Prayer, J. Pan, S. Rohrich, H. Prosch, G. Langs, Automatic lung segmentation in routine imaging is a data diversity problem, not a methodology problem, Eur. Radiol. Exp., 4 (2020), 1-13. https://doi.org/10.1186/s41747-020-00173-2 doi: 10.1186/s41747-020-00173-2
    [40] Y. Yang, Q. Li, Y. Guo, Y. Liu, X. Li, J. Guo, et al., Lung parenchyma parameters measure of rats from pulmonary window computed tomography images based on ResU-Net model for medical respiratory researches, Math. Biosci. Eng., 18 (2021), 4193-4211. https://doi.org/10.3934/mbe.2021210 doi: 10.3934/mbe.2021210
    [41] Y. Yang, Y. Guo, J. Guo, Y. Gao, Y. Kang, A method of abstracting single pulmonary lobe from computed tomography pulmonary images for locating COPD, in Proceedings of the Fourth International Conference on Biological Information and Biomedical Engineering, (2020), 1-6. https://doi.org/10.1145/3403782.3403805
    [42] J. J. M. van Griethuysen, A. Fedorov, C. Parmar, A. Hosny, N. Aucoin, V. Narayan, et al., Computational radiomics system to decode the radiographic phenotype, Cancer Res., 77 (2017), 104-107. https://doi.org/10.1158/0008-5472.can-17-0339 doi: 10.1158/0008-5472.can-17-0339
    [43] R. Tibshirani, Regression shrinkage and selection via the Lasso, J. R. Stat. Soc. B, 58 (2007), 267-288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x doi: 10.1111/j.2517-6161.1996.tb02080.x
    [44] N. Simon, J. Friedman, T. Hastie, R. Tibshirani, Regularization paths for Cox's proportional hazards model via coordinate descent, J. Stat. Software, 39 (2011), 1-13. https://doi.org/10.18637/jss.v039.i05 doi: 10.18637/jss.v039.i05
    [45] Y. Qi, Random forest for bioinformatics, in Ensemble machine learning: methods and applications, Springer, Boston, MA, (2012), 307-323. https://doi.org/10.1007/978-1-4419-9326-7_11
    [46] T. H. Kim, D. C. Park, D. M. Woo, T. Jeong, S. Y. Min, Multi-class classifier-based adaboost algorithm, in International conference on intelligent science and intelligent data engineering, Springer, Berlin, Heidelberg, (2011), 122-127. https://doi.org/10.1007/978-3-642-31919-8_16
    [47] V. K. Ayyadevara, Gradient boosting machine, in Pro machine learning algorithms, Apress, Berkeley, CA, (2018), 117-134. https://doi.org/10.1007/978-1-4842-3564-5_6
    [48] M. Taki, A. Rohani, F. Soheili-Fard, A. Abdeshahi, Assessment of energy consumption and modeling of output energy for wheat production by neural network (MLP and RBF) and Gaussian process regression (GPR) models, J. Cleaner Prod. 172 (2018), 3028-3041. https://doi.org/10.1016/j.jclepro.2017.11.107 doi: 10.1016/j.jclepro.2017.11.107
    [49] W. Hu, W. Hu, S. Maybank, Adaboost-based algorithm for network intrusion detection, IEEE Trans. Syst. Man Cybern. Part B Cybern., 38 (2008), 577-583. https://doi.org/10.1109/tsmcb.2007.914695 doi: 10.1109/tsmcb.2007.914695
    [50] S. Suthaharan, Support vector machine, in Machine learning models and algorithms for big data classification, Springer, Boston, MA, (2016), 207-235. https://doi.org/10.1007/978-1-4899-7641-3_9
    [51] Q. Li, Y. Yang, Y. Guo, W. Li, Y. Liu, H. Liu, et al., Performance evaluation of deep learning classification network for image features, IEEE Access, 9 (2021), 9318-9333. https://doi.org/10.1109/access.2020.3048956 doi: 10.1109/access.2020.3048956
    [52] M. A. Carbonneau, V. Cheplygina, E. Granger, G. Gagnon, Multiple instance learning: A survey of problem characteristics and applications, Pattern Recognit., 77 (2018), 329-353. https://doi.org/10.1016/j.patcog.2017.10.009 doi: 10.1016/j.patcog.2017.10.009
    [53] H. Polat, H. D. Mehr, Classification of pulmonary CT images by using hybrid 3D-deep convolutional neural network architecture, Appl. Sci., 9 (2019), 1-15. https://doi.org/10.3390/app9050940 doi: 10.3390/app9050940
    [54] A. Chon, N. Balachandar, P. Lu, Deep convolutional neural networks for lung cancer detection, Standford Univ., (2017), 1-9. https://doi.org/10.1109/uemcon47517.2019.8993023 doi: 10.1109/uemcon47517.2019.8993023
    [55] S. P. Singh, L. Wang, S. Gupta, H. Goli, P. Padmanabhan, B. Gulyás, 3D deep learning on medical images: a review, Sensors, 20 (2020), 1-24. https://doi.org/10.3390/s20185097 doi: 10.3390/s20185097
    [56] B. H. Lee, D. H. Oh, T. Y. Kim, 3D Virtual reality game with deep learning-based hand gesture recognition, J. Korea Comput. Graphics Soc., 24 (2018), 41-48. https://doi.org/10.15701/kcgs.2018.24.5.41 doi: 10.15701/kcgs.2018.24.5.41
    [57] B. George, S. Seals, I. Aban, Survival analysis and regression models, J. Nucl. Cardiol., 21 (2014), 686-694. https://doi.org/10.1007/s12350-014-9908-2 doi: 10.1007/s12350-014-9908-2
    [58] L. Torrey, J. Shavlik, Transfer learning, in Handbook of research on machine learning applications and trends: algorithms, methods, and techniques, IGI global, (2010), 242-264. https://doi.org/10.4018/978-1-60566-766-9.ch011
  • Reader Comments
  • © 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(4355) PDF downloads(292) Cited by(15)

Figures and Tables

Figures(15)  /  Tables(12)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog