Processing math: 100%
Research article Special Issues

Biomedical image segmentation algorithm based on dense atrous convolution

  • Received: 19 November 2023 Revised: 15 February 2024 Accepted: 20 February 2024 Published: 26 February 2024
  • Biomedical images have complex tissue structures, and there are great differences between images of the same part of different individuals. Although deep learning methods have made some progress in automatic segmentation of biomedical images, the segmentation accuracy is relatively low for biomedical images with significant changes in segmentation targets, and there are also problems of missegmentation and missed segmentation. To address these challenges, we proposed a biomedical image segmentation method based on dense atrous convolution. First, we added a dense atrous convolution module (DAC) between the encoding and decoding paths of the U-Net network. This module was based on the inception structure and atrous convolution design, which can effectively capture multi-scale features of images. Second, we introduced a dense residual pooling module to detect multi-scale features in images by connecting residual pooling blocks of different sizes. Finally, in the decoding part of the network, we adopted an attention mechanism to suppress background interference by enhancing the weight of the target area. These modules work together to improve the accuracy and robustness of biomedical image segmentation. The experimental results showed that compared to mainstream segmentation networks, our segmentation model exhibited stronger segmentation ability when processing biomedical images with multiple-shaped targets. At the same time, this model can significantly reduce the phenomenon of missed segmentation and missegmentation, improve segmentation accuracy, and make the segmentation results closer to the real situation.

    Citation: Hong'an Li, Man Liu, Jiangwen Fan, Qingfang Liu. Biomedical image segmentation algorithm based on dense atrous convolution[J]. Mathematical Biosciences and Engineering, 2024, 21(3): 4351-4369. doi: 10.3934/mbe.2024192

    Related Papers:

    [1] Yiwen Tao, Zhenqiang Zhang, Bengbeng Wang, Jingli Ren . Motality prediction of ICU rheumatic heart disease with imbalanced data based on machine learning. Big Data and Information Analytics, 2024, 8(0): 43-64. doi: 10.3934/bdia.2024003
    [2] Cai-Tong Yue, Jing Liang, Bo-Fei Lang, Bo-Yang Qu . Two-hidden-layer extreme learning machine based wrist vein recognition system. Big Data and Information Analytics, 2017, 2(1): 59-68. doi: 10.3934/bdia.2017008
    [3] Tieliang Gong, Qian Zhao, Deyu Meng, Zongben Xu . Why Curriculum Learning & Self-paced Learning Work in Big/Noisy Data: A Theoretical Perspective. Big Data and Information Analytics, 2016, 1(1): 111-127. doi: 10.3934/bdia.2016.1.111
    [4] M Supriya, AJ Deepa . Machine learning approach on healthcare big data: a review. Big Data and Information Analytics, 2020, 5(1): 58-75. doi: 10.3934/bdia.2020005
    [5] Maria Gabriella Mosquera, Vlado Keselj . Identifying electronic gaming machine gambling personae through unsupervised session classification. Big Data and Information Analytics, 2017, 2(2): 141-175. doi: 10.3934/bdia.2017015
    [6] Bill Huajian Yang . Resolutions to flip-over credit risk and beyond-least squares estimates and maximum likelihood estimates with monotonic constraints. Big Data and Information Analytics, 2018, 3(2): 54-67. doi: 10.3934/bdia.2018007
    [7] Bill Huajian Yang, Jenny Yang, Haoji Yang . Modeling portfolio loss by interval distributions. Big Data and Information Analytics, 2020, 5(1): 1-13. doi: 10.3934/bdia.2020001
    [8] Ricky Fok, Agnieszka Lasek, Jiye Li, Aijun An . Modeling daily guest count prediction. Big Data and Information Analytics, 2016, 1(4): 299-308. doi: 10.3934/bdia.2016012
    [9] Bill Huajian Yang . Modeling path-dependent state transitions by a recurrent neural network. Big Data and Information Analytics, 2022, 7(0): 1-12. doi: 10.3934/bdia.2022001
    [10] Marco Tosato, Jianhong Wu . An application of PART to the Football Manager data for players clusters analyses to inform club team formation. Big Data and Information Analytics, 2018, 3(1): 43-54. doi: 10.3934/bdia.2018002
  • Biomedical images have complex tissue structures, and there are great differences between images of the same part of different individuals. Although deep learning methods have made some progress in automatic segmentation of biomedical images, the segmentation accuracy is relatively low for biomedical images with significant changes in segmentation targets, and there are also problems of missegmentation and missed segmentation. To address these challenges, we proposed a biomedical image segmentation method based on dense atrous convolution. First, we added a dense atrous convolution module (DAC) between the encoding and decoding paths of the U-Net network. This module was based on the inception structure and atrous convolution design, which can effectively capture multi-scale features of images. Second, we introduced a dense residual pooling module to detect multi-scale features in images by connecting residual pooling blocks of different sizes. Finally, in the decoding part of the network, we adopted an attention mechanism to suppress background interference by enhancing the weight of the target area. These modules work together to improve the accuracy and robustness of biomedical image segmentation. The experimental results showed that compared to mainstream segmentation networks, our segmentation model exhibited stronger segmentation ability when processing biomedical images with multiple-shaped targets. At the same time, this model can significantly reduce the phenomenon of missed segmentation and missegmentation, improve segmentation accuracy, and make the segmentation results closer to the real situation.



    Sepsis [1] is defined as a life-threatening clinical syndrome characterized by organ dysfunction due to an imbalanced bodily response to infection. Each year in the United States (US), severe sepsis and septic shock [2] affects 750,000 patients, with a mortality rate of up to 30%. Over half of these patients are admitted to the intensive care unit (ICU), accounting for 10% of all ICU deaths and 20% to 30% of all hospital deaths, resulting in annual healthcare costs of $15.4 billion within the US [3,4,5]. Many nations have analyzed and projected the sepsis mortality rates in their respective countries. For instance, Weng et al. [6] conducted a study on sepsis in China, and revealed significant regional disparities in sepsis-related deaths. Their multilevel analysis indicated that the male gender, advancing age, and comorbidities were independently associated with increased sepsis-related mortality. Fleischmann et al. [7] investigated sepsis mortality in Germany, and found that infection and sepsis were significant causes of hospitalization and death. Prest et al. [8] examined sepsis deaths in the US from 2005 to 2018, and observed that mortality is influenced by race and geographic location. Furthermore, even after discharge, sepsis patients are prone to complications and sequelae that can compromise their health and elevate their 90-day mortality risk. Consequently, sepsis can be regarded as a global public health concern. Given the high mortality rate and economic burden associated with sepsis, timely and effective risk assessments and treatments have become particularly crucial.

    A timely sepsis treatment necessitates an early risk assessment [9,10,11] of the patient, highlighting the importance of an effective scoring system for clinicians to identify high-risk patients and to estimate the likelihood of death. Various severity scores [12] are widely used in ICUs to assess the disease severity and prognostic outcomes. However, when it comes to diagnosing sepsis, the clinical application of these scores remains limited. This is due to the complex pathogenesis of sepsis and the inadequacy of relying on a single scoring system. For instance, the acute physiology and chronic health evaluation II (APACHE II) score [13] underestimates the risk of death in sepsis patients in the ICU. Similarly, the simplified acute physiological scoring II (SAPS II) [14] has proven ineffective in external validation studies, further emphasizing the need for more accurate and comprehensive assessment tools in sepsis diagnoses.

    Given these limitations, the exploration of new predictive models, particularly those based on machine learning, has become imperative. Numerous researchers have developed sepsis mortality prediction models that leverage machine learning algorithms [15,16,17,18]. These models tend to focus on patients with severe sepsis and septic shock, though they exhibit a poor specificity for sepsis patients. William et al. [19] constructed a mortality risk model that utilized laboratory data and clinical indicators, and demonstrated the feasibility of assessing the sepsis patients' death risk. Nevertheless, most of these models employ either traditional machine learning or unsupervised learning algorithms, which often lack interpretability. To address this issue, we have established a sepsis mortality risk scoring card inspired by financial scoring cards. In this scoring card, each feature is assigned a score within a specific range, and the final death risk is determined by summing these scores, providing a more interpretable and comprehensive assessment tool for sepsis diagnosis.

    In this paper, drawing inspiration from financial scoring cards, we develop a sepsis mortality risk scoring card that leverages machine learning techniques, specifically a logistic regression, to identify high-risk sepsis patients upon admission to the ICU. By incorporating easily accessible clinical and laboratory data routinely collected within the first 24 hours of ICU admission, our proposed scoring card aims to provide clinicians with a comprehensive and concise tool to promptly assess the sepsis patients' mortality risk and guiding early intervention to ultimately improve the patient outcomes.

    Our method to predict the sepsis mortality risk was modeled and tested on the electronic health record (EHR) data from the Medical Information Mart for Intensive Care Ⅲ (MIMIC Ⅲ) database, version 1.4 [20]. We have obtained ethics approval (registration code: 9641799), which granted us access to use this data. From the MIMIC Ⅲ database, we extracted information on patients admitted to the ICU with sepsis, diagnosed according to the Sepsis 3.0 criteria—specifically, those with suspected infection and a sequential organ failure assessment (SOFA) score of 2 or higher. The primary outcome measure was whether these sepsis patients succumbed within 31 days. The label indicated whether the sepsis patients succumbed within 31 days, where "1" denoted patient survival, and "0" signified death.

    We excluded patients based on the following criteria:

    1)  Non-adults (aged 12 years or younger);

    2)  Patients with multiple hospital admissions;

    3)  Patients whose infection occurred within 24 hours prior to ICU admission; and

    4)  Patients whose infection occurred more than 24 hours after ICU admission.

    After identifying the study population, we subsequently extracted relevant physiological and laboratory data from the MIMIC Ⅲ database.

    In this section, we made use of the structured query language (SQL) to pinpoint sepsis patients who met the diagnostic criteria set forth by sepsis 3.0. Following this identification, we proceeded to collect the physiological and laboratory data from the first 24 hours of their ICU admission. The collected basic information encompassed details such as age, sex, race, and admission_type.

    The vital signs collected included the following: heart rate (HR), respiration rate (RR), systolic blood pressure (SBP), diastolic blood pressure (DBP), mean arterial pressure (MAP), body temperature (Tem), and oxygen saturation (Spo2).

    The laboratory data collected included the following: white blood cell count (WBC), blood lactic acid (Lactate), potassium, serum creatinine (Creatinine), Platelets, Blood Urea Nitrogen (BUN), Sodium Ions (Sodium), chloride ions (Chloride), glucose, international standardized ratio (INR), hemoglobin, hematocrit, and anion space (Anion).

    The traditional severity scores include: Oxford acute illness severity score (OASIS), logistic organ dysfunction system (LODS), sequential organ failure assessment (SOFA), system inflammatory reaction syndrome (SIRS), and quick SOFA (qSOFA).

    All variables were recorded during the first 24h of ICU admission.

    After collecting the data, necessary preprocessing was required due to potential issues such as missing values, outliers, and correlations among features that existed in the raw data. When extracted data remains unprocessed, it frequently gives rise to several issues. For instance, disparities in dimensions hinder direct comparisons. The presence of extreme values and outliers within the dataset diminishes its usability. The correlation among features often results in redundancies, thereby undermining the model's efficacy. Moreover, missing values in the data necessitate varying approaches for either imputation or deletion. Consequently, feature engineering becomes imperative to enhance data utilization and the model accuracy.

    We need to preprocess the data with a weight of evidence (WOE) encoding. It is a recoding method applied to independent variables. To perform WOE coding on the variables, we first need to use chi-square boxes to group the variables. Chi-square binning is a bottom-up (i.e., pooling-based) method of data discretization that relies on a chi-square test: the adjacent bins with the smallest chi-square values are pooled together until a certain stopping condition is met. After grouping, for the i-th group, the calculation formula of the WOE is as follows:

    WOEi=ln(pyipni)=ln(#yi/#yT#ni/#nT)=ln(#yi/#ni#yT/#nT) (1)

    Among them, pyi represents the proportion of surviving patients in a particular group relative to all surviving patients in the entire sample (analogous to surviving patients in a medical context). Similarly, pni signifies the proportion of deceased patients in that group compared to all deceased patients in the sample. The variables #yi, #ni, #yT, and #nT denote the number of surviving patients in the group, the number of deceased patients in the group, the total number of surviving patients in the entire sample, and the total number of deceased patients in the entire sample, respectively. The WOE embodies the ratio of surviving to deceased patients in the present group, juxtaposed with this ratio across the entire sample. A higher WOE value indicates a more significant disparity, suggesting that the specific group exerts a greater influence on the dependent variables.

    In the binary classification problem of machine learning, the information value (Ⅳ) primarily serves to encode the input variables and to assess their predictive capability. The Ⅳ of a characteristic variable signifies its ability to predict, and its computation relies on the WOE:

    IVi=(pyipni)×WOEi (2)

    The Ⅳ value of the variable is obtained by adding the Ⅳs of each group, IV=IVi. The calculation of the Ⅳ value is the WOE multiplication coefficient, and this coefficient takes the proportion of the samples in this group to the overall sample into account. The lower the proportion, the lower the contribution of this group is to the overall predictive ability of the variable. Therefore, a reasonable indicator to judge the predictive ability can be obtained. By applying WOE encoding and the Ⅳ value calculation, we can effectively address the extreme values and redundant features within the data, thereby laying a solid foundation for the subsequent model development.

    In this paper, we used the Python programming language (version 3.7.0) for data processing. The main packages used are numpy (version 1.19.2) and pandas (version 1.1.5).

    Our proposed predictive model uses the individual patient data available within 24 hours of one admission to the ICU and generates the probability of death within 31 days. This prediction task can be solved by various statistical and machine learning models. In the current study, we evaluated a logistic regression, XGBoost, K-Nearest neighbor (KNN), decision tree, and support vector machines on the laboratory datasets. XGBoost is the latest implementation of a gradient tree enhancement system, which includes combining many "weak" decision tree predictors into a single powerful predictor. In view of the strong interpretability of the logistic regression algorithm, we used it to construct the sepsis death risk score card. In addition, to prevent over-fitting, we used a hierarchical five-fold-cross validation to evaluate the generalize ability of our prediction model. The main evaluation indexes of the model were accuracy, area under the curve (AUC), specificity, and sensitivity. Machine learning training and evaluation were performed in Python, using the sklearn (version 0.24.1) and scorecardpy (version 1.1.9.2) packages. The analysis code for this study is available on reasonable request.

    The sample with multiple missing values was deleted, and the variables with missing data less than 5% were filled with an average interpolation. Then, Pearson's correlation was calculated for the continuous variables. A correlation analysis found that the correlation between hemoglobin and hematocrit (0.96), chloride and sodium (0.70), and the BUN and creatinine (0.68) variables were high. The two variables with high correlations were deleted according to the Ⅳ value (the one with the higher Ⅳ value is retained). Figure 1 illustrates the correlation among various features, providing a visual representation of the interrelationships between different characteristics within the dataset: the darker the color, the higher the correlation. The Ⅳ value can be used to describe the predictive ability of the variables, and the relationship between the Ⅳ value and the predictive ability is as follows: Ⅳ value <= 0.02, no predictive ability; 0.02 <= Ⅳ value <= 0.1, weak predictive ability; 0.1 <= Ⅳ value <= 0.3, moderate predictive ability; 0.3 <= Ⅳ value <= 0.5, strong predictive ability; and 0.5 <= Ⅳ value <= 1, doubtful. Finally, we selected 9 variables with an Ⅳ value greater than 0.1 as the input variables.

    Figure 1.  The correlation among various features.

    The variables that ultimately entered the model and demonstrated strong predictive capabilities for labels include Anion, INR, RR, BUN, HR, admission_type, lactate, age, and Tem.

    A preliminary study conducted on the MIMIC Ⅲ database identified 37,721 hospitalized patients who had only one ICU admission. Among these, 30,075 patients aged 12 years or older were selected for further analysis. However, 5004 patients were excluded due to infections that occurred either too early or too late in their hospital stay, 12,616 patients did not meet the sepsis 3.0 criteria, and another 4536 patients were excluded due to incomplete basic data. This resulted in a final cohort of 7919 patients, among whom 1665 died, yielding a mortality rate of 20%. Table 1 provides an overview of the demographic and clinical characteristics of all patients included in the analysis, grouped by survivors and non-survivors. For the continuous variables, the mean (± standard deviation) is presented, while for the categorical variables, the number and proportion are shown.

    Table 1.  The basic information of patients.
    Survivors (n = 6254) Non–survivors (n = 1665)
    Demographic
    Age (yr) 63.5 (45.8–81.2) 69.5 (52.4–86.3)
    Admission_type (n, %)
    Elective 1079 (17.3) 31 (1.9)
    Urgent 5087 (81.3) 1604 (96.3)
    Emergency 88 (1.4) 30 (1.8)
    Basic Vital Signs
    Heart Rate 103.1 (83.1–123.0) 111.6 (87.1–135.9)
    Respiratory Rate 26.0 (19.4–32.8) 29.2 (22.0–36.3)
    Temperature 37.5 (36.7–38.4) 37.2 (36.1–38.5)
    Laboratory Indicators
    Lactate 3.0 (0.8–5.2) 4.9 (0.9–9.0)
    INR 1.6 (0.2–3.0) 2.2 (0.1–4.3)
    Anion 15.8 (11.1–20.6) 19.6 (13.2–26.0)
    BUN 17.7 (5.5–50.2) 40.7 (12.8–68.9)
    Severity Score
    SOFA 5 (3–8) 8 (4–12)
    SIRS 2 (1–4) 3 (2–4)
    OASIS 34 (26–42) 43 (34–52)
    LODS 5 (3–7) 8 (5–11)
    qSOFA 2 (1–8) 2 (1–5)

     | Show Table
    DownLoad: CSV

    The distribution of our variables is significantly skewed. Introducing these variables into the model without any preprocessing would result in information dispersion, thereby hindering an accurate prediction of the mortality risk. To visualize this skewness, we have crafted several variable box plots, which are exhibited in Figure 2. Evidently, these variables possess numerous extreme values. Based on their predictive capacity for the mortality rate, each feature has been segmented into 2–4 categories or "boxes".

    Figure 2.  The box plots of variables ANION, BUN, HR, INR, LACTATE, RR.

    The Ⅳ value of the final retained variables stands as follows: HR: 0.138, Tem: 0.139, age: 0.138, RR: 0.233, INR: 0.254, admission_type: 0.370, BUN: 0.380, anion: 0.476, and lactate: 0.380. For Ⅳ values of other variables, refer to Table 2. These explanatory variables demonstrate a remarkable predictive strength, and their Ⅳ values reflect the significance of these features.

    Table 2.  The Ⅳ value of variables.
    variables Ⅳ value
    Potassium 0.0114
    DBP 0.0124
    Sex 0.0146
    SBP 0.0217
    MAB 0.0224
    Platelet 0.028
    Gcs 0.0323
    Sodium 0.0451
    Spo2 0.0499
    WBC 0.0499
    Chloride 0.0559
    Glucose 0.0662
    Hematocrit 0.0681
    Hemoglobin 0.0799
    Age 0.1375
    HR 0.1375
    Tem 0.1387
    RR 0.2328
    INR 0.2538
    Creatinine 0.2944
    Admission_Type 0.3689
    Bun 0.3794
    Lactate 0.3804
    Anion 0.4756

     | Show Table
    DownLoad: CSV

    The boxing of variables is illustrated in Figure 3. For instance, in part (c), the variable BUN is partitioned into four ranges: less than 18, 18–24, 24–34, and greater than 34. Within the figure, red signifies deceased patients, while blue represents the survivors. On the bar chart, the notation (32.4%, 2564) indicates that 2564 patients had a blood urea nitrogen index value less than 18, constituting 32.4% of the total population. The 89.4% on the line chart signifies that among these patients, 89.4% survived. Likewise, 83.7% implies that out of the 1689 patients (comprising 21.4% of the total sample) with a BUN value in the range of [18,24], the survival rate was 83.7%.

    Figure 3.  Chi square box division results of nine input variables.

    To address the issue of the imbalanced proportions of positive and negative samples in datasets, the synthetic minority over-sampling technique (SMOTE) upsampling was employed. The dataset was randomly partitioned into two subsets at a 3 : 7 ratio, where the larger subset was designated for model training and evaluation, and the smaller one was reserved for the final model validation. All nine variables were incorporated into the model, and the evaluation metrics included the AUC, sensitivity, specificity, and accuracy. Cross-validation revealed that among all tested models, XGBoost exhibited the best performance, achieving an AUC of 0.91. Other models demonstrated a similar respectable performance, with AUCs of 0.81 for the logistic regression, 0.82 for the KNN, 0.82 for the decision tree, and 0.81 for the support vector machine. Figure 4 illustrates the receiver operating characteristic (ROC) curves of each model on the test set, highlighting that XGBoost remained the top performer in the test set, attaining an AUC of 0.91, whereas the AUCs of the other models ranged from 0.8 to 0.85.

    Figure 4.  Roc curves of each machine learning model.

    We have presented the accuracy, sensitivity, specificity, and AUC of each model on both the training and test sets in Table 3. The ROC curves of each machine learning model are presented in Figure 4. The accuracy provides a straightforward assessment of the model's ability to correctly predict the outcome for the patients. Among the commonly used evaluation indicators, the specificity and sensitivity hold particular significances. The sensitivity reflects the model's ability to identify patients who have passed away, while the specificity demonstrates its capacity to recognize those who have survived. Given the focus of our model on identifying patients at higher risk of mortality, the sensitivity became a crucial metric. Notably, XGBoost and the decision tree exhibited stronger sensitivities, with values of 0.84 and 0.78, respectively.

    Table 3.  Model evaluation results.
    Accuracy Sensitivity Specificity AUC
    Logistic train 0.71 0.73 0.71 0.80
    test 0.72 0.72 0.73 0.81
    XGBoost train 0.85 0.86 0.84 0.93
    test 0.83 0.84 0.82 0.91
    K-Neighbors train 0.78 0.81 0.75 0.87
    test 0.74 0.74 0.73 0.82
    Decision Tree train 0.74 0.79 0.69 0.82
    test 0.73 0.78 0.69 0.82
    Support Vector train 0.74 0.74 0.74 0.82
    test 0.74 0.72 0.75 0.81

     | Show Table
    DownLoad: CSV

    Currently, numerous scholars have developed models to predict the mortality risk among sepsis patients. In this section, we compare our model with several previously published models to demonstrate its efficacy. Table 4 summarizes the comparison outcomes.

    Table 4.  Comparision with other similar models.
    Model Variables number Sample size ML model AUC
    Model1 9 7919 XGBoost 0.91
    Model2 5 159 Logistic 0.82
    Model3 13 11101 LASSO 0.78
    Model4 20 1344 XGBoost 0.84
    Model5 20 9954 Random forest 0.86

     | Show Table
    DownLoad: CSV

    Model 1: Wan et al. [21] employed five medical examination indicators, including PCT and interleukin 6, and used a logistic regression to achieve an AUC of 0.82. However, their model relies on indicators that are only detectable when the disease progresses to a severe stage, thus limiting early prediction.

    Model 2: Zhang et al. [22] used MIMIC Ⅲ data to construct a Lasso model, and achieved an AUC of 0.78. Their approach of dividing variables into only two categories resulted in some information loss.

    Model 3: William's team [23] created an XGBoost model using comprehensive patient data and emergency room visits, and achieved an AUC of 0.84 on the validation set.

    Model 4: Taylor et al. [24] built a model that included the random forest, logistic regression, and the classification and regression tree (CART) models, utilizing patient data from four EDs. The random forest model performed the best, and yielded an AUC of 0.86.

    In contrast, our model outperforms these existing models due to our unique feature processing technique. By employing chi-square classification for the feature selection, we maximized the data utilization and facilitated the construction of a scoring card, thereby enhancing the predictive accuracy. Our XGBoost model achieved an AUC of 0.91, demonstrating its superiority.

    Based on the fundamental principles of a logistic regression, let us denote the probability of survival as p, and consequently, the probability of mortality as 1-p. According to the financial scorecard methodology, odds represents the ratio between the number of creditworthy (good) customers and uncreditworthy (bad) customers, where the creditworthy customers are assigned higher scores. When adapting this framework to assess the mortality risk scores in a medical context, we consider that the severity of a disease directly correlates with an increased mortality rate. Therefore, patients with more severe conditions, which indicate a higher mortality risk, should receive higher scores as an early warning indicator. In this analogy, a "good customer" corresponds to a patient whose final outcome is unfortunately death, while a "bad customer" represents a patient who survives. With this paradigm, we can derive the following:

    odds=p1p. (3)

    The probability of survival can be expressed as follows:

    p=odds1+odds. (4)

    Our score can be expressed as a linear expression of the logarithm of the ratio, and the score card scale can be set accordingly:

    Score=ABln(odds). (5)

    As we can see from the formula, the higher the probability of death, the higher the score. We need the following two assumptions to solve for A and B:

    (1) Set the expected score to be S0 when the odd is θ0; and

    (2) When the θ0 is doubled, the change in the score is PDO. By substituting the hypothesis into Eq (3), we can obtain the following:

    {S0=ABln(θ0)S0+PDO=ABln(2θ0). (6)

    The values A and B are obtained by solving the following formula:

    {A=S0+Bln(θ0)B=PDOln2. (7)

    Additionally, the logistic regression model is expressed as follows:

    ln(odds)=b+ω1x1+ω2x2++ωnxn. (8)

    The formula for the score can be rewritten as follows:

    Score=AB(b+ω1x1+ω2x2++ωnxn). (9)

    We set the parameters as S0 = 10 and PDO = 2. The final scoring card, which is presented in

    Table 5, facilitates the assessment of each patient's score, with higher scores indicating an elevated risk of mortality. In terms of the admission type, elective hospitalizations are associated with a lower mortality risk, hence the assigned score of –3, whereas emergency hospitalizations, due to their higher mortality risk, received a score of 2. Given that the mortality rate in the dataset is 20%, the corresponding odds are calculated to be 4. This allows us to determine the score values associated with different probabilities of death. Specifically, a score of 12 corresponds to a 33% risk of death, 14 points indicate a 50% risk, and 17 points signify a 75% risk of death. We categorize the scores into four risk levels: low risk ([2, 12)), medium-low risk ([12, 14)), medium-high risk ([14, 17)), and high risk ([17, 26)), with the latter representing the highest risk of death.

    Table 5.  Sepsis patient mortality risk scorecard.
    Characteristics Numerical range Score
    RR (b/min) < 20 0
    20–30 1
    > 30 3
    BUN (mg/dL) < 18 0
    18–24 1
    24–34 2
    > 34 3
    INR < 2 1
    > 2 3
    Age < 46 0
    46–76 1
    76–86 3
    > 86 4
    Anion (mEq/L) < 19 1
    19–24 2
    > 24 3
    Admission_type ELECTⅣE –3
    EMERGENCY,
    URGENT
    2
    Tem (C) < 36 4
    36–37 2
    > 37 1
    HR (b/min) < 115 1
    > 115 2
    Lactate (mmol/L) < 4 1
    4–6 2
    6–9 3
    > 9 5

     | Show Table
    DownLoad: CSV

    Using this scoring system, we analyzed 7919 sepsis patients from the dataset, as illustrated in Figure. Our findings reveal a clear trend: patients who eventually succumbed to the illness tended to have significantly higher scores, with only 12.2% falling into the low-risk category. Conversely, patients who survived had predominantly lower scores, with just 7.8% classified as high-risk. The numbers presented in the table further demonstrate the effectiveness of this scoring system to distinguish between patients based on their risk levels. For instance, 3128 represents the number of surviving patients with low-risk scores, accounting for 50% of all survivors, whereas 203 denotes the number of deceased patients with low-risk scores, constituting 12.2% of all fatalities. These results closely align with the actual outcomes, underscoring the model's effectiveness.

    To further validate the robustness of our model, we not only benchmarked it against other predictive models, but also conducted a comparative analysis with existing medical scoring systems. Our findings reveal that our model outperforms the currently available scoring systems. For illustrative purposes, the AUC curve of our scorecard model is exhibited in Figure 6. As evident from Figure 7, the existing medical scores exhibit varying degrees of sensitivity and specificity. Among them, LODS stands out with an AUC of 0.78, along with a high sensitivity of 0.86. However, its specificity is relatively low at 0.48. Notably, the AUC of our scorecard model, 0.79, surpasses the highest LODS score, 0.78, among the other medical assessments. Derived from a logistic regression, LODS proves to be a useful tool, but it falls short of being the ultimate predictor of mortality risk in the sepsis patients. The OASIS model demonstrates a more balanced approach with an AUC of 0.75, a sensitivity of 0.76, and a specificity of 0.64. While its performance surpasses other medical scores, it slightly lags behind the logistic regression models. It's worth noting that the specificity of the SOFA scoring model might be skewed due to its limitation in patient screening (SOFA >= 2). On the other hand, the QSOFA score, primarily designed for use outside the ICU, may not provide accurate predictions for patients within the ICU setting. Both the OASIS and LODS scores assess the disease severity, though neither is tailored specifically to sepsis. Given these limitations and the unique challenges posed by sepsis, the development of this mortality risk scoring system specifically designed for sepsis patients in this paper holds a strong practical significance.

    Figure 5.  Differences in scores between patients who died and those who survived.
    Figure 6.  ROC curve of the scorecard model scoring models.
    Figure 7.  The results compared with other medical scoring systems.

    Our study showcased the utilization of machine learning models to predict 31-day mortality among sepsis patients admitted to the ICU, with several significant findings and implications.

    1)  Precision of Machine Learning Models: We established the precision of machine learning-driven models in forecasting 31-day mortality among sepsis patients. Our study demonstrated that the XGBoost model performed exceptionally well, with an AUC of 0.91, and outperformed other traditional machine learning algorithms and clinical risk scores.

    2)  Development of Mortality Risk Scoring System: By harnessing various methodologies, including a logistic regression, we formulated a mortality risk scoring system specifically tailored for sepsis patients. This scoring system integrated only nine straightforward variables routinely documented in clinical environments, and eliminated the need for manual data input [19].

    3)  Comparison with Existing Clinical Risk Scores: Our scoring system outperformed several widely-used clinical risk scores, such as qSOFA, LODS, OASIS, SIRS, and SOFA [25,26,27,28,29].

    4)  Applicability and Limitations: Although our model demonstrated a commendable performance, several limitations exist. First, as with any retrospective analysis, the included data and its precision hinge on the original ICU records, making it prone to potential biases. Second, our focus was on sepsis patients admitted to the ICU, thus limiting the generalizability of our findings to other patient populations. Lastly, the conversion of continuous variables into discrete ones through variable binning might have slightly compromised the prediction accuracy.

    Future Directions: Prospective studies are needed to assess the efficacy of our approach and its potential to enhance the clinical outcomes in patients at a high risk of sepsis. Furthermore, external validations of our model in different patient populations are needed to confirm its generalizability.

    This study retrospectively analyzed the MIMIC Ⅲ database, and utilized sepsis 3.0 diagnostic criteria alongside the first-day laboratory indicators and clinical data of patients. Through a machine learning technology, we constructed a mortality risk scoring card specifically tailored for sepsis patients. This scoring system outperformed several existing medical scores. By leveraging easily accessible indicators, this paper enabled the rapid determination of death risk scores shortly after patients are admitted to the ICU. This timely assessment aids physicians in quickly understanding a patient's physical condition, potentially leading to an improved prognosis.

    The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

    This work was supported by the National Natural Science Foundation of China under the Grant Nos. 11702250 and the Key Scientific Research Projects of Colleges and Universities in Henan Province under Grant No.22A520010.

    The authors declare that they have no conflict of interest with respect to the research, authorship, and publication of the article.



    [1] L. Kong, Q. Wang, Y. Bao, H. Li, A survey on medical image segmentation based on deep leaning, Radio Commun. Technol., 47 (2021), 121–130. https://doi.org/10.3969/j.issn.1003-3114.2021.02.001 doi: 10.3969/j.issn.1003-3114.2021.02.001
    [2] J. Chen, L. Li, Automatic segmentation of fuzzy areas in ultrasonic images based on edge detection, Autom. Instrument., 11 (2021), 19–22. https://doi.org/10.14016/j.cnki.1001-9227.2021.11.019 doi: 10.14016/j.cnki.1001-9227.2021.11.019
    [3] A. Aslam, E. Khan, M. M. S. Beg, Improved edge detection algorithm for brain tumor segmentation, Proced. Computer Sci., 58 (2015), 430–437. https://doi.org/10.1016/j.procs.2015.08.057 doi: 10.1016/j.procs.2015.08.057
    [4] M. Van Eijnatten, R. van Dijk, J. Dobbe, G. Streekstra, J. Koivisto, J. Wolff, CT image segmentation methods for bone used in medical additive manufacturing, Med. Eng. Phys., 51 (2018), 6–16. https://doi.org/10.1016/j.medengphy.2017.10.008 doi: 10.1016/j.medengphy.2017.10.008
    [5] J. Long, E. Shelhamer, T. Darrell, Fully convolutional networks for semantic segmentation, in Proceedings of the IEEE conference on computer vision and pattern recognition, (2015), 3431–3440. https://doi.org/10.1109/CVPR.2015.7298965
    [6] O. Ronneberger, P. Fischer, T. Brox, U-net: Convolutional networks for biomedical image segmentation, in International Conference on Medical image computing and computer assisted intervention, (2015), 234–241. https://doi.org/10.1007/978-3-319-24574-4_28
    [7] O. Oktay, J. Schlemper, L. L. Folgoc, M. Lee, M. Heinrich, K. Misawa, et al., Attention u-net: Learning where to look for the pancreas, arXiv preprint, (2018), arXiv: 1804.03999. https://doi.org/10.48550/arXiv.1804.03999
    [8] Z. Zhou, M. M. R. Siddiquee, N. Tajbakhsh, J. Liang, Unet++: A nested u-net architecture for medical image segmentation, Deep learning in medical image analysis and multimodal learning for clinical decision support, (2018) 3–11. https://doi.org/10.1007/978-3-030-00889-5_1 doi: 10.1007/978-3-030-00889-5_1
    [9] M. Z. Alom, M. Hasan, C. Yakopcic, T. M. Taha, V. K. Asari, Recurrent residual convolutional neural network based on u-net (r2u-net) for medical image segmentation, arXiv preprint, (2018), arXiv: 1802.06955. https://doi.org/10.48550/arXiv.1802.06955
    [10] G. Huang, S. Liu, L. Van der Maaten, K. Q. Weinberger, Condensenet: An efficient densenet using learned group convolutions, in Proceedings of the IEEE conference on computer vision and pattern recognition, (2018), 2752–2761. https://doi.org/10.1109/CVPR.2018.00291
    [11] F. Zhu, Z. Gao, C. Zhao, Z. Zhu, J. Tang, Y. Liu, et al., Semantic segmentation using deep learning to extract total extraocular muscles and optic nerve from orbital computed tomography images, Optik, 244 (2021), 167551. https://doi.org/10.1016/j.ijleo.2021.167551 doi: 10.1016/j.ijleo.2021.167551
    [12] H. Li, J. Fan, Q. Hua, X. Li, Z. Wen, M. Yang, Biomedical sensor image segmentation algorithm based on improved fully convolutional network, Measurement, 197 (2022), 111307. https://doi.org/10.1016/j.measurement.2022.111307 doi: 10.1016/j.measurement.2022.111307
    [13] X. Liu, D. Zhang, J. Yao, J. Tang, Transformer and convolutional based dual branch network for retinal vessel segmentation in OCTA images, Biomed. Signal Process. Control, 83 (2023). https://doi.org/10.1016/j.bspc.2023.104604 doi: 10.1016/j.bspc.2023.104604
    [14] N. Mu, Z. Lyu, M. Rezaeitaleshmahalleh, J. Tang, J. Jiang, An attention residual u-net with differential preprocessing and geometric postprocessing: Learning how to segment vasculature including intracranial aneurysms, Med. Image Anal., 84 (2023), 102697. https://doi.org/10.1016/j.media.2022.102697 doi: 10.1016/j.media.2022.102697
    [15] J. Qin, J. Wu, P. Yan, M. Li, R. Yuxi, X. Xiao, et al., FreeSeg: unified, universal and open-vocabulary image segmentation, inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2023), 19446–19455. https://doi.org/10.1109/CVPR52729.2023.01863
    [16] T. Ma*, H. Zhao, X. Qin, A dehazing method for flight view images based on transformer and physical priori, Math. Biosci. Eng., 20 (2023), 20727–20747. http://dx.doi.org/10.3934/mbe.2023917 doi: 10.3934/mbe.2023917
    [17] A. Kirillov, E. Mintun, N. Ravi, H. Mao, C. Rolland, L. Gustafson, et al., Segment anything, arXiv preprint, (2023), arXiv: 2304.02643. https://doi.org/10.48550/arXiv.2304.02643
    [18] X. Zhao, W. Ding, Y. An, Y. Du, T. Yu, M. Li, et al., Fast Segment Anything, arXiv preprint, (2023), arXiv: 2306.12156. https://doi.org/10.48550/arXiv.2306.12156
    [19] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, et al., Going deeper with convolutions, in Proceedings of the IEEE conference on computer vision and pattern recognition, (2015), 1–9. https://doi.org/10.1109/CVPR.2015.7298594
    [20] S. Ioffe, C. Szegedy, Batch normalization: accelerating deep network training by reducing internal covariate shift, in International conference on machine learning, (2015), 448–456. https://doi.org/10.48550/arXiv.1502.03167
    [21] C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, Z. Wojna, Rethinking the inception architecture for computer vision, in Proceedings of the IEEE conference on computer vision and pattern recognition, (2016), 2818–2826. https://doi.org/10.1109/CVPR.2016.308
    [22] C. Szegedy, S. Ioffe, V. Vanhoucke, A. Alemi, Inception-v4, inception-resnet and the impact of residual connections on learning, in Proceedings of the AAAI conference on artificial intelligence, 31 (2017). https://doi.org/10.48550/arXiv.1602.07261
    [23] T. Ma, C. Fu, J. Yang, J. Zhang, C. Shang, RF-Net: unsupervised low-light image enhancement based on retinex and exposure fusion, Comput. Mater. Continua, 77 (2023), 1103–1122. https://doi.org/10.32604/cmc.2023.042416 doi: 10.32604/cmc.2023.042416
    [24] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, et al., Going deeper with convolutions, in Proceedings of the IEEE conference on computer vision and pattern recognition, (2015), 1–9. https://doi.org/10.1109/CVPR.2015.7298594
    [25] Y. Wei, H. Xiao, H. Shi, Z. Jie, J. Feng, T. S. Huang, Revisiting dilated convolution: A simple approach for weakly and semi supervised semantic segmentation, in Proceedings of the IEEE conference on computer vision and pattern recognition, (2018), 7268–7277. https://doi.org/10.1109/CVPR.2018.00759
    [26] D. M. Vo, S. W. Lee, Semantic image segmentation using fully convolutional neural networks with multi-scale images and multi-scale dilated convolutions, Multimedia Tools Appl., 77 (2018), 18689–18707. https://doi.org/10.1007/s11042-018-5653-x doi: 10.1007/s11042-018-5653-x
    [27] H. Li, Q. Zheng, W. Yan, R. Tao, X. Qi, Z. Wen, Image super-resolution reconstruction for secure data transmission in Internet of Things environment, Math. Biosci. Eng., 18 (2021), 6652–6672. https://doi.org/10.3934/mbe.2021330 doi: 10.3934/mbe.2021330
    [28] H. Cheng, J. Lu, M. Luo, W. Liu, K. Zhang, PTANet: Triple attention network for point cloud semantic segmentation, Eng. Appl. Artif. Intell., 102 (2021), 104239. https://doi.org/10.1016/j.engappai.2021.104239 doi: 10.1016/j.engappai.2021.104239
    [29] Z. Gu, J. Cheng, H. Fu, K. Zhou, H. Hao, Y. Zhao, et al., Ce-net: Context encoder network for 2d medical image segmentation, IEEE Transact. Med. Imag., 38 (2019), 2281–2292. https://doi.org/10.1109/TMI.2019.2903562 doi: 10.1109/TMI.2019.2903562
  • Reader Comments
  • © 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(1953) PDF downloads(113) Cited by(6)

Figures and Tables

Figures(9)  /  Tables(3)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog