Loading [MathJax]/jax/output/SVG/jax.js
Review

Ways to improve biocides for metalworking fluid

  • Metalworking fluids (MWF) are mainly emulsions of oil in water containing additives such as corrosion inhibitors, emulsifiers, defoamers, and biocides. Microbial contamination of MWF is almost systematic, and some of their constituents serve as nutrients for contaminating microorganisms. Biocides for MWF are protection products used to counter microbial contaminations and growth. Ideally, a biocide for MWF should have the following non-exhaustive criteria: have a broad-spectrum activity, be usable at low concentrations, be compatible with the formulation and the physical-chemical properties of MWF, be stable over time, retain its effectiveness in the presence of soiling, have no corrosive action on metals, present no danger to humans and the environment, be inexpensive. The future lies in the development of new molecules with biocidal activity corresponding to these ideal specifications, but in the meantime, it is possible to improve the performance of existing molecules currently on the market. Different strategies for potentiation of the activity of existing biocides are possible. The compatibility of the potentiation strategies with their use in metal working fluids is discussed.

    Citation: Patrick Di Martino. Ways to improve biocides for metalworking fluid[J]. AIMS Microbiology, 2021, 7(1): 13-27. doi: 10.3934/microbiol.2021002

    Related Papers:

    [1] Shipeng Li . Impulsive control for stationary oscillation of nonlinear delay systems and applications. Mathematical Modelling and Control, 2023, 3(4): 267-277. doi: 10.3934/mmc.2023023
    [2] Yanchao He, Yuzhen Bai . Finite-time stability and applications of positive switched linear delayed impulsive systems. Mathematical Modelling and Control, 2024, 4(2): 178-194. doi: 10.3934/mmc.2024016
    [3] Gani Stamov, Ekaterina Gospodinova, Ivanka Stamova . Practical exponential stability with respect to hmanifolds of discontinuous delayed Cohen–Grossberg neural networks with variable impulsive perturbations. Mathematical Modelling and Control, 2021, 1(1): 26-34. doi: 10.3934/mmc.2021003
    [4] Yao Chu, Xiuping Han, R. Rakkiyappan . Finite-time lag synchronization for two-layer complex networks with impulsive effects. Mathematical Modelling and Control, 2024, 4(1): 71-85. doi: 10.3934/mmc.2024007
    [5] Hongyu Ma, Dadong Tian, Mei Li, Chao Zhang . Reachable set estimation for 2-D switched nonlinear positive systems with impulsive effects and bounded disturbances described by the Roesser model. Mathematical Modelling and Control, 2024, 4(2): 152-162. doi: 10.3934/mmc.2024014
    [6] Qin Xu, Xiao Wang, Yicheng Liu . Emergent behavior of Cucker–Smale model with time-varying topological structures and reaction-type delays. Mathematical Modelling and Control, 2022, 2(4): 200-218. doi: 10.3934/mmc.2022020
    [7] Sheng Wang, Baoli Lei . Dynamics of a stochastic hybrid delay one-predator-two-prey model with harvesting and jumps in a polluted environment. Mathematical Modelling and Control, 2025, 5(1): 85-102. doi: 10.3934/mmc.2025007
    [8] Hongwei Zheng, Yujuan Tian . Exponential stability of time-delay systems with highly nonlinear impulses involving delays. Mathematical Modelling and Control, 2025, 5(1): 103-120. doi: 10.3934/mmc.2025008
    [9] Bangxin Jiang, Yijun Lou, Jianquan Lu . Input-to-state stability of delayed systems with bounded-delay impulses. Mathematical Modelling and Control, 2022, 2(2): 44-54. doi: 10.3934/mmc.2022006
    [10] Zeyan Yue, Lijuan Dong, Sheng Wang . Stochastic persistence and global attractivity of a two-predator one-prey system with S-type distributed time delays. Mathematical Modelling and Control, 2022, 2(4): 272-281. doi: 10.3934/mmc.2022026
  • Metalworking fluids (MWF) are mainly emulsions of oil in water containing additives such as corrosion inhibitors, emulsifiers, defoamers, and biocides. Microbial contamination of MWF is almost systematic, and some of their constituents serve as nutrients for contaminating microorganisms. Biocides for MWF are protection products used to counter microbial contaminations and growth. Ideally, a biocide for MWF should have the following non-exhaustive criteria: have a broad-spectrum activity, be usable at low concentrations, be compatible with the formulation and the physical-chemical properties of MWF, be stable over time, retain its effectiveness in the presence of soiling, have no corrosive action on metals, present no danger to humans and the environment, be inexpensive. The future lies in the development of new molecules with biocidal activity corresponding to these ideal specifications, but in the meantime, it is possible to improve the performance of existing molecules currently on the market. Different strategies for potentiation of the activity of existing biocides are possible. The compatibility of the potentiation strategies with their use in metal working fluids is discussed.


    In the financial market, risk management has become one of the key elements of banking. Credit card customer churn prediction plays an important role in the risk management of the banking industry, assisting banks in identifying potential risks and implementing suitable measures to mitigate them (Butaru et al., 2016). With the development of technology, the influence of customer churn prediction has extended into various fields, including the financial industry. In particular, the rapid progression of machine learning theories and algorithms has offered a fresh perspective and solution for customer churn prediction (Alfaiz and Fati, 2022). Firstly, machine learning technology is capable of automatically extracting useful information from large amounts of historical data, identifying distribution patterns and relationships to predict future trends and behaviors (Pudjihartono et al., 2022). In credit card customer churn prediction, machine learning methods can efficiently identify key variables and underlying relationships related to customer attrition. By analyzing the mass multi-dimensional data, including consumer behavior, credit history, and demographic information of historical users, machine learning can construct an effective prediction model. Secondly, machine learning technology has strong generalization ability, automatically adapting to new and unseen data after training without frequent adjustments and optimizations (Janiesch et al., 2021). Bank customer churn prediction using machine learning methods presents both challenges and opportunities. Through comprehensive studies and the application of machine learning methods, prediction models provide more precise and reliable results, thereby enhancing the accuracy and reliability of risk management standards in the credit card business and supporting the sustainable development of banking.

    Despite the significant advantages of machine learning in customer churn prediction, its practical implementation still encounters several challenges. As datasets grow larger and models become more complex, ensuring model interpretability becomes a crucial concern (Chen and Meng, 2020). The SHAP values method is applied to interpret the outcomes of machine learning models. It quantifies the importance of each variable in the dataset, considering both the independent influence of each variable on prediction outcomes and the interactions among different variables. The SHAP values method provides a comprehensive explanation of machine learning model predictions (Gebreyesus et al., 2023). Moreover, the causal inference method has gained significant popularity in recent years for data analysis and machine learning-related research. By analyzing causal effects among variables, the causal inference method excludes variables that are related to prediction results but lack actual causal connections, and filters out variables with a causal impact on prediction results. This method provides a more reliable basis for variable selection in predictions based on machine learning (Li et al., 2023). The purpose of this study is to discuss the construction of credit card customer churn prediction models by applying machine learning techniques. Additionally, the SHAP values method is utilized to complete variable selection based on the importance of influencing the outcomes and to interpret machine learning-based prediction results. The causal inference method aims to explain the prediction results by analyzing the causal relationships between variables. These causal relationships are compared with the results obtained by the SHAP values method.

    The remainder of this study is organized as follows. Section 2 reviews research related to machine learning, interpretability analysis, and causal inference. Section 3 describes the concepts of four sampling techniques for data preprocessing, six machine learning methods for credit card customer churn prediction, and the causal inference method in detail. Section 4 summarizes the performance of the machine learning models, interprets the optimal model, and identifies the causal relationships by analyzing the experimental results. Finally, Section 5 presents the conclusion and describes limitations and further research.

    With the development of computer hardware in recent years, technologies such as machine learning and deep learning are now widely used across various industries. Machine learning can be combined with stock price prediction to construct new quantitative investment strategies in financial investment (Yan et al., 2023). In the field of stock market risk, traditional machine learning models and neural networks can also predict Barrier Option prices with relative accuracy (Li and Yan, 2023). Even Bitcoin, a cryptocurrency that has been widely discussed in recent years, can have its price predicted by machine learning methods, with the Support Vector Regression model outperforming other models (Erfanian et al., 2022). In the area of credit card customer churn prediction, there is a growing number of reliable research papers aiming to predict churn rates by analyzing the personal and behavioral data of credit card customers, enabling banks to take effective measures to retain customers in advance. For instance, Zhang et al. used simple preprocessing of customer data and applied only the Random Forest (RF) model, a traditional machine learning method, to classify customers, achieving better classification results (Zhang et al., 2024). In the study by de Lima Lemos et al., several traditional machine learning models were used for prediction, including RF, Decision Tree, k-Nearest Neighbor, Elastic Net, Logistic Regression, and Support Vector Machines, with the RF model achieving the best classification accuracy at 82.8% (de Lima Lemos et al., 2022). Lalwani et al. introduced more advanced ideas into the data preprocessing problem, using the Gravitational Search Algorithm to select variables and improve the efficiency of the machine learning models (Lalwani et al., 2022). Siddiqui et al. used three variable selection methods to observe the performance of different machine learning models by comparing all variables, separating continuous and discrete variables, and selecting variables based on their importance. They ultimately found that using all variables yielded the most prevalent classification results (Siddiqui et al., 2024).

    In addition to focusing on constructing different models to improve classification efficacy, researchers have gradually paid more attention to the interpretability of the models and the relationships between variables. In the field of financial investment, some articles focus on the impact of each variable on volatility risk (Yan and Li, 2024). In the study by AL-Najjar et al., they demonstrated the variable importance of the same traditional machine learning model, C5 Tree, under different methods of variable selection. The results showed that the three main variables affecting the model's effectiveness were the total number of transactions, the total credit card revolving balance, and the change in the number of transactions, providing useful references for credit card managers (AL-Najjar et al., 2022). Other studies discuss variables using more advanced and understandable variable visualization tools, such as the SHAP values method. In Peng et al.'s study, the best model for classification, which combined a genetic algorithm with XGBoost, was used to analyze interpretability. The results of the SHAP values method not only showed the order of importance of different independent variables, but also indicated whether changes in the value of each independent variable had a positive or negative effect on the predicted variable (Peng et al., 2023). In addition to the SHAP values method, other interpretable tools for visualization with similar effects, such as Local Interpretable Model-Agnostic Explanations (LIME), can also be used for analysis (Chang et al., 2024).

    So far, machine learning algorithms have primarily captured correlations between variables, and inferring their importance from a correlation perspective, often ignoring the causal relationships among them. In other words, the model's determination of variable importance is frequently based on the strength of the correlation between the dependent and independent variables, which can introduce bias errors in actual predictions (Feuerriegel et al., 2024). To address this issue, this study innovatively uses the R-learner in meta learners to analyze variable importance. This approach combines the fields of machine learning and causal inference (Künzel et al. , 2019). The R-learner employs the control variable method to observe how independent variables affect the probability of credit card customer churn and calculates a more accurate conditional average treatment effect (CATE) by removing bias from the data. This helps quantify the effect of different variables on the probability of credit card customer churn.

    The aim of this study is to predict whether bank credit card customers will churn using machine learning methods and to utilize the prediction results to construct models that explain the influencing factors of customer churn. This will provide an effective basis for rationalizing customer management before they leave the bank and for effective risk management in the banking industry. Initially, the relevant dataset of bank customer churn needs to be downloaded and preprocessed. Subsequently, the approach is divided into two parts. The first part involves balancing the class distribution of the training set using four sampling techniques: Random Oversampling, Synthetic Minority Oversampling Technique (SMOTE), Borderline-Synthetic Minority Oversampling Technique (Borderline-SMOTE), and Adaptive Synthetic Sampling (ADASYN). Then, six machine learning methods, including RF, Gradient Boosting Decision Tree (GBDT), Extra Tree, AdaBoost, XGBoost, and CatBoost, are used to predict whether the bank will lose credit card customers. The optimal model is selected by comparing the performance of each model, and the important variables influencing customer churn and their effects are analyzed using the SHAP values method. The second part involves using the causal inference method to investigate the causal impact of variables on customer churn based on the optimal prediction model mentioned above. In this part, the R-learner is used for continuous variables in the meta learners of the causal inference method. The framework of this study is outlined in Figure 1.

    Figure 1.  The research framework diagram.
    Table 1.  The summary of categorical variables.
    Categorical Variables Class Number of classes Conversion to numbers
    Attrition_Flag Existing Customer 8500 0
    Attrited Customer 1627 1
    Gender M 4769 0
    F 5358 1
    Education_Level Uneducated 1487 6
    High School 2013 15
    College 1013 18
    Graduate 3128 22
    Post-Graduate 516 24
    Doctorate 451 28
    Unknown 1519 22
    Marital_Status Single 3943 0
    Married 4687 1
    Divorced 748 2
    Unknown 749 1
    Income_Category Less than $40K 3561 2
    $40K - $60K 1790 5
    $60K - $80K 1402 7
    $80K - $120K 1535 10
    $120K + 727 14
    Unknown 1112 2
    Card_Category Blue 9436 0
    Silver 555 1
    Gold 116 2
    Platinum 20 3

     | Show Table
    DownLoad: CSV

    Historical data of bank customers downloaded from Kaggle is used as the dataset in this study (Kaggle, 2022). The dataset contains twenty three variables, including customers' fundamental personal details, the credit card level they hold, and their credit card usage, totaling 10,127 samples. The variable labeled "CLIENTNUM" represents the unique identification number assigned by the bank to each customer. Since this variable has no effect on the model construction in this study, it is removed from the analysis. Additionally, the last two variables in the dataset represent the outcomes provided by Kaggle for predicting bank customer churn based on Naive Bayes classifiers. These variables have been excluded during the data processing as this study does not compare this probabilistic classification approach with machine learning methods. The remaining twenty variables, which consist of fourteen quantitative variables and six categorical variables, are all utilized in the experiment in this study to discuss the importance and impact. These variables represent customers' personal information and credit card usage. To build the model, it is necessary to convert the classes of categorical variables into numerical values. These six categorical variables are: "Attrition_Flag", "Gender", "Education_Level", "Marital_Status", "Income_Category", and "Card_Category". For ordered categorical variables, the classes need to be sorted from low to high before conversion. One type of class encountered in the variables "Education_Level", "Marital_Status", and "Income_Category" is "Unknown". There are 3,046 samples that meet the criterion of having at least one of these three variables labeled as "Unknown", constituting 30% of the total sample size. The treatment for "Unknown" in any of these three variables is to change it to the majority class of the respective variable. The details of these six categorical variables are outlined in Table 2.

    Table 2.  The evaluation results of prediction models in the test set.
    Sampling techniques Machine learning accuracy precision recall F1 AUC
    Random Oversampling RF 0.9650 0.9648 0.9650 0.9449 0.9901
    GBDT 0.9743 0.9746 0.9743 0.9745 0.9954
    Extra Tree 0.9620 0.9613 0.9620 0.9613 0.9890
    AdaBoost 0.9521 0.9568 0.9521 0.9535 0.9893
    XGBoost 0.9748 0.9751 0.9748 0.9749 0.9939
    CatBoost 0.9556 0.9606 0.9556 0.9570 0.9900
    SMOTE RF 0.9615 0.9627 0.9615 0.9620 0.9897
    GBDT 0.9699 0.9701 0.9699 0.9700 0.9945
    Extra Tree 0.9600 0.9596 0.9600 0.9597 0.9881
    AdaBoost 0.9556 0.9565 0.9556 0.9560 0.9856
    XGBoost 0.9704 0.9704 0.9704 0.9704 0.9919
    CatBoost 0.9590 0.9607 0.9590 0.9596 0.9884
    Borderline-SMOTE RF 0.9585 0.9583 0.9585 0.9588 0.9891
    GBDT 0.9709 0.9711 0.9709 0.9710 0.9936
    Extra Tree 0.9556 0.9549 0.9556 0.9551 0.9886
    AdaBoost 0.9516 0.9538 0.9516 0.9524 0.9855
    XGBoost 0.9719 0.9719 0.9719 0.9719 0.9934
    CatBoost 0.9497 0.9532 0.9497 0.9508 0.9857
    ADASYN RF 0.9605 0.9615 0.9605 0.9609 0.9901
    GBDT 0.9709 0.9710 0.9709 0.9709 0.9939
    Extra Tree 0.9566 0.9561 0.9566 0.9563 0.9885
    AdaBoost 0.9576 0.9589 0.9876 0.9581 0.9859
    XGBoost 0.9724 0.9724 0.9724 0.9724 0.9931
    CatBoost 0.9546 0.9575 0.9546 0.9556 0.9878

     | Show Table
    DownLoad: CSV

    The variable "Marital_Status" is an unordered categorical variable with more than two distinct values, requiring conversion into dummy variables.

    The variable "Attrition_Flag" indicates whether a customer has churned or not, serving as the target variable of this study. The dataset contains 1,627 samples in the "Attrited Customer" class. The ratio of these samples to those in the "Existing Customer" class is greater than 5:1, indicating a significant class imbalance, as shown in Figure 2.

    Figure 2.  The proportion of classes for "Attrition_Flag".

    If the training set is used by models without preprocessing, it will decrease the ability of the machine learning models to recognize churned customers, leading to overfitting in the results and reducing prediction accuracy (Alam et al., 2020). To avoid these problems and improve the performance of the model, improved oversampling has been adopted to balance the class distribution, which involves increasing the number of samples from the minority class. In this study, the following four sampling algorithms are used.

    Random oversampling is a straightforward method that randomly selects and replicates samples from the minority class until the classes are balanced.

    SMOTE is an improved algorithm for handling imbalanced data based on random oversampling. The first step involves calculating the distance between each sample from the minority class and identifying the k nearest neighbors of the same class. Then, a specified number of samples are randomly selected from these nearest neighbors. A new synthetic sample is generated between the minority class sample and the selected nearest neighbor, lying on the line connecting these two samples, thereby increasing the number of minority class samples (Akın, 2023).

    Borderline-SMOTE is an improved version of the SMOTE algorithm. The key difference is an additional preliminary step: a minority class sample is selected to apply the SMOTE algorithm if most of its k nearest neighbors are from the majority class (Gu et al., 2023).

    ADASYN is another common oversampling method similar to SMOTE. The key to this algorithm is calculating the ratio of samples between different classes for each minority class sample and using this ratio distribution to determine the number of synthetic samples generated for each minority class sample (Dube and Verster, 2023).

    This study employs six machine learning methods to predict credit card customer churn. In addition to RF and AdaBoost, which were used in the previous research to predict barrier option prices (Li and Yan, 2023), this study utilizes the extra tree algorithm and three other prevalent boosting algorithms: GBDT, XGBoost, and CatBoost. To enhance the performance of these machine learning models, the grid search method is used to tune the hyperparameters of each model and find the optimal set. A complete customer churn prediction model can be constructed by combining one of the sampling techniques mentioned above with one of these six machine learning methods. The prediction will result in four possible cases:

    ● True positive (TP): The predicted result is positive, and the actual value is also positive.

    ● True negative (TN): The predicted result is negative, and the actual value is also negative.

    ● False positive (FP): The predicted result is positive, whereas the actual value is negative.

    ● False negative (FN): The predicted result is negative, whereas the actual value is positive.

    TP and TN indicate the predicted results are consistent with the actual values, and FP and FN indicate the opposite. To evaluate the performance of models in binary classification problems, the following indicators are typically utilized: accuracy, precision, recall, F1 score, and AUC. The formulas for the first four indicators are defined as follows:

    accuracy=TP+TNTP+TN+FP+FN, (1)
    precision=TPTP+FP, (2)
    recall=TPTP+FN, (3)
    F1=2×precision×recallprecision+recall. (4)

    The value of AUC is equal to the area under the receiver operating characteristic curve (ROC), which is the curve consisting of recall and false positive rate (FPR). The formula for FPR is as follows:

    FPR=FPFP+TN. (5)

    The closer the values of these five indicators are to 1, the better the model performs.

    Causal inference is a method that effectively explores and analyzes whether a variable is the main factor affecting the target variable ((Jiang, 2022). In traditional causal inference, the randomized controlled trial is generally considered as a reliable methodology to determine the influence of variables. However, in practice, due to the cost and ethical concerns of experiment, causal inference is often based on collected observational datasets. In Facure's book, which explores the combination of machine learning and causal inference, the authors propose a modeling approach called R-learner for continuous variables in a dataset to assess the causal significance of an independent variable on a dependent variable (Facure M, 2023). In current industrial applications, various code tools are becoming available for researchers to explore the feasibility of machine learning in causal inference problems (Molak A, 2023). To determine whether a variable has a causal relationship with customer churn and to quantify its impact, this study adopts the causal inference method to understand the causal effects among them.

    In causality, the cause variable is the treatment denoted as T, while the dependent variable is the outcome denoted as Y. Additionally, the other independent variables are referred to as characteristic variables, denoted as X. For an individual sample i, the prediction model predicts the corresponding value Yi as Yi(T=1|X) when the treatment is applied (i.e., Ti=1), and as Yi(T=0|X) without the treatment (i.e., Ti=0). Individual treatment effect (ITE) calculates the difference in outcomes for an individual between the scenario with treatment and the scenario without treatment. The formula of ITE is shown as follows:

    ITEi=Yi(T=1|X)Yi(T=0|X). (6)

    Considering the overall casual effect between the treatment and the outcome, it needs to measure CATE, which is given by the following equation:

    CATE=E[Y|T=1,X]E[Y|T=0,X]. (7)

    The value of CATE indicates whether there exists a causal relationship between the treatment variable T on the outcome Y, and the quantification of its effect.

    For binary classification problems, this study uses R-learner to estimate the causality and effect of different continuous treatment variables on the outcome Y respectively, based on the optimal customer churn prediction model. The CATE value in R-learner is obtained by calculating the probability of the same outcome occurring with and without the treatment, which is given by the following equation:

    CATE=P(Y=1|T=1,X)P(Y=1|T=0,X). (8)

    For each T, hypothesis testing is applied to CATE to determine the casual impact of that treatment variable on Y.

    The experimental environment for this study is Python, utilizing toolboxes such as imblearn, sklearn, xgboost, and catboost. Following the data preprocessing steps outlined in Section 3.1, the dataset is divided into a training set and a test set in an 8:2 ratio. The training set data is balanced using four sampling techniques, and the independent variables of all the data are normalized. Subsequently, six machine learning methods are trained on the training set data, resulting in twenty-four complete customer churn prediction models. These trained models are then applied to predict the "Attrition_Flag" values in the test set, and the performance of these models is evaluated using the indicators mentioned in Section 3.3.

    The results of the accuracy for the training set, evaluated using six machine learning methods, range from 96.06% to 99.92% under random oversampling, 97.33% to 99.88% under SMOTE, 96.95% to 99.94% under Borderline-SMOTE, and 96.88% to 99.91% under ADASYN. The test set results are as follows.

    As shown in Table 2, the values of all indicators for the complete customer churn prediction models are above 0.94 after hyperparameter tuning and training, indicating good performance for each model. The difference in the evaluation results of the prediction models between the training set and the test set is not significant, and there is no instance of complete overfitting. For the random oversampling-XGBoost, SMOTE-XGBoost, Borderline-SMOTE-XGBoost, and ADASYN-XGBoost models, all indicator values exceed 0.97, suggesting that these four models perform better than the others. These four models can be considered the optimal prediction models. Furthermore, the XGBoost model performs better and more stably compared to other machine learning models across different sampling techniques by optimizing both the loss function and regularization term (Guo and Fan, 2024).

    The SHAP values method is used to understand the contribution of each variable in the optimal prediction models and its relationship with the target variable better (Wu et al., 2024). When the SHAP value of a variable is positive, it indicates a positive relationship with customer churn, whereas a negative SHAP value indicates a negative relationship. By using the SHAP toolbox in Python, an interpretable explanation of the optimal prediction models is obtained, resulting in corresponding SHAP summary plots. The variables in the SHAP summary plots are sorted from highest to lowest according to their importance in each model, with the top 10 variables selected. The points corresponding to each variable represent their SHAP values.

    Figure 3.  The SHAP summary plot of random oversampling-XGBoost model.
    Figure 4.  The SHAP summary plot of SMOTE-XGBoost model.
    Figure 5.  The SHAP summary plot of BorderlineSMOTE-XGBoost model.
    Figure 6.  The SHAP summary plot of ADASYN-XGBoost model.

    Based on the four figures above, in any of the optimal prediction models, the variables "Total_Trans_Ct", "Total_Trans_Amt", and "Total_Revolving_Bal" consistently rank in the top three in terms of importance and in the same order. Moreover, the positive or negative relationship between each of these variables and "Attrition_Flag" remains unchanged. These three variables represent the customer's total number of transactions with the bank in the past year, the total amount of transactions, and the total revolving balance of the credit card, respectively. As the total number of transactions increases, the SHAP value decreases, indicating a negative relationship with customer churn. This suggests that customers with a higher number of transactions are less likely to churn. For the second most important variable, the higher the total amount of transactions, the higher the SHAP value. When the total transaction amount is small, the SHAP value can be either positive or negative. There is a significant positive relationship with customer churn only when the total amount exceeds a certain threshold, indicating that customers are more likely to churn when their total transaction amount is above this threshold. For "Total_Revolving_Bal", the greater the total revolving balance of the credit card, the lower the SHAP value, indicating that customers with a larger total credit card revolving balance are less prone to churn compared to those with smaller total credit card revolving balance.

    Furthermore, for each optimal prediction model, the variables "Total_Relationship_Count", "Total_Amt_Chng_Q4_Q1", and "Total_Ct_Chng_Q4_Q1" also rank among the top 10 of the SHAP values, consistently appearing in the middle positions. These three variables represent the total number of bank products held by customers, the change in the amount from the fourth quarter to the first quarter, and the change in the number of transactions, respectively. For these three variables, as their values decrease, their SHAP values increase, leading to the predicted result tending closer to 1, which indicates a negative relationship with customer churn. This means that customers who hold a larger number of products or have significant changes in amount and transaction frequency across different quarters are less likely to churn.

    For the remaining variables in the SHAP summary plots, although some are important in most optimal prediction models, they pertain to customers' personal information. Therefore, it is not convenient to discuss and classify the attributes of variables such as "Customer_Age", "Marital_Status_0", "Marital_Status_1", and "Education_Level". Additionally, "Months_Inactive_12_mon" for the total number of months inactive in the past year, "Contacts_Count_12_mon" for the total number of contacts in the past year, and "Gender" are only among the top 10 in the random oversampling-XGBoost model. Among these, "Gender" is personal information, while the other two variables are positively related to customer churn. Furthermore, the variable "Avg_Utilization_Ratio" only appears in the ADASYN-XGBoost model, ranking 10th in importance. This variable represents the average utilization rate of bank credit cards, with a lower rate indicating a higher likelihood of customer churn.

    The statsmodels toolbox in python is used to analyze the causal effects between variables without personal information of customers and customer churn in the SHAP summary plots. The XGBoost model is selected for estimating customer churn in R-learner due to its superior predictive performance. The experimental results of the causal inference method are listed as follows.

    According to Table 3, the p-values for "Total_Revolving_Bal" and "Avg_Utilization_Ratio" are greater than 0.05, indicating no significant difference in customer churn caused by changes in these two treatment variables. Combined with the SHAP values analysis, this suggests that there is only a correlation between each of these two variables and customer churn, not a causal relationship. For the remaining treatment variables, the p-values are all less than 0.05, indicating that changes in each of these variables have causal relationships with customer churn.

    Table 3.  The experimental results of R-learner.
    Variables coef std err t P>|t| 95 % interval
    Total_Trans_Ct 0.0003 2.48e-05 13.109 0.000 [0.000, 0.000]
    Total_Trans_Amt 7.163e-06 4.14e-07 17.308 0.000 [6.35e-06, 7.97e-06]
    Total_Revolving_Bal 4.396e-06 5.8e-06 0.758 0.449 [-1.58e-05, 6.98e-06]
    Total_Relationship_Count 0.0008 0.000 6.324 0.000 [-0.001, -0.001]
    Total_Amt_Chng_Q4_Q1 0.0079 0.001 7.078 0.000 [0.010, 0.006]
    Total_Ct_Chng_Q4_Q1 0.0051 0.001 4.819 0.000 [0.007, 0.003]
    Months_Inactive_12_mon 0.0007 0.000 3.968 0.000 [0.000, 0.001]
    Contacts_Count_12_mon 0.0006 0.000 3.872 0.000 [0.000, 0.001]
    Avg_Utilization_Ratio 0.0039 0.010 0.384 0.701 [0.016, 0.024]

     | Show Table
    DownLoad: CSV

    Considering the coefficient values in Table 3, the treatment variables "Total_Trans_Ct", "Total_Relationship_Count", "Total_Amt_Chng_Q4_Q1", and "Total_Ct_Chng_Q4_Q1" have negative coefficients, indicating negative causal relationships with customer churn. Among these variables, "Total_Amt_Chng_Q4_Q1" has the largest causal effect, while "Total_Trans_Ct" has the smallest. Conversely, the coefficients of "Total_Trans_Amt", "Months_Inactive_12_mon", and "Contacts_Count_12_mon" are positive, indicating positive causal relationships with customer churn. "Months_Inactive_12_mon" displays the largest causal effect, while "Total_Trans_Amt" displays the smallest. Compared with the results of the SHAP values method, the similarity is that these seven variables have the same direction of impact on customer churn, while the difference lies in the order of the quantified effect values.

    In the current research, to enhance the accuracy of prediction results, a combination of sampling techniques and machine learning models was employed to forecast customer churn in banks. A comparative performance analysis indicates that the XGBoost model consistently outperforms other machine learning models, achieving an accuracy of at least 97%, regardless of the sampling techniques used. Furthermore, the SHAP values method was utilized to interpret the optimized prediction models, while R-learner was used to investigate the causal effects of these variables on customer churn. Based on these two methods, the main important variables affecting customer churn, which include the total number and amount of transactions with the bank in the past year, the total number of bank products held by the customer, and the changes in the amount and number of transactions from the fourth quarter to the first quarter, were identified. Additionally, the analysis found that the total credit card revolving balance does not have a significant causal relationship with customer churn, but there is a strong correlation. The research findings provide valuable recommendations for bank managers to improve customer management strategies.

    Due to the limited number of minority class samples in the dataset, the experiment requires sampling techniques to generate synthetic samples. This approach enables the prediction model to more accurately identify the categories of samples. Furthermore, excluding the variables belonging to customers' personal information, the other variables consist of cross-sectional data. They are typically utilized in the analysis that emphasize the differences among individual samples, rather than focusing on changes within a sample over time. Therefore, if additional samples with more extensive variables, such as time series data, are available the feasibility of the model's predictions, so that a better comparison of the interaction between the SHAP values method and the casual inference method in the further research.

    The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this study.

    The authors have received no financial assistance from any source in the preparation of this study.

    All authors declare no conflicts of interest in this study.


    Acknowledgments



    This work was supported in part by a grant from the company CONDAT.

    Conflict of interest



    The author declares no conflicts of interest in this article.

    [1] EU Regulation N° 528/2012 https://eur-lex.europa.eu/legal-content/FR/TXT/?uri=CELEX:32012R0528.
    [2] Silva V, Silva C, Soares P, et al. (2020) Isothiazolinone biocides: chemistry, biological, and toxicity profiles. Molecules 25: 991.
    [3] Catao E, Gallois N, Fay F, et al. (2021) Metal resistance genes enrichment in marine biofilm communities selected by biocide-containing surfaces in temperate and tropical coastal environments. Environ Pollut 268: 115835.
    [4] Parr JA (1990) Industrial biocide formulation—The way forward. Int Biodeter 26: 237-244.
    [5] Morton LHG, Greenway DLA, Gaylarde CC, et al. (1998) Consideration of some implications of the resistance of biofilms to biocides. Int Biodeter Biodegrad 41: 247-259.
    [6] McDonnell G, Russell AD (1999) Antiseptics and disinfectants: activity, action, and resistance. Clin Microbiol Rev 12: 147-179.
    [7] Maillard JY (2002) Bacterial target sites for biocide action. J Appl Microbiol 92: 16S-27S.
    [8] Chapman JS (2003) Biocide resistance mechanisms. Int Biodeter Biodegrad 51: 133-138.
    [9] Morente EO, Fernández-Fuentes MA, Grande Burgos MJ, et al. (2013) Biocide tolerance in bacteria. Int J Food Microbiol 162: 13-25.
    [10] Longtin J, Seah C, Siebert K, et al. (2011) Distribution of antiseptic resistance genes qacA, qacB, and smr in methicillin-resistant Staphylococcus aureus isolated in Toronto, Canada, from 2005 to 2009. Antimicrob Agents CH 55: 2999-3001.
    [11] Tkachenko O, Shepard J, Aris VM, et al. (2007) A triclosan-ciprofloxacin cross-resistant mutant strain of Staphylococcus aureus displays an alteration in the expression of several cell membrane structural and functional genes. Res Microbiol 158: 651-658.
    [12] McMurry LM, Oethinger M, Levy SB (1998) Triclosan targets lipid synthesis. Nature 394: 531-532.
    [13] Miller SM (1999) Bacterial detoxification of Hg (II) and organomercurials. Essays Biochem 34: 17-30.
    [14]  ISO 20776–1: 2019 Susceptibility testing of infectious agents and evaluation of performance of antimicrobial susceptibility test devices—Part 1: Broth micro-dilution reference method for testing the in vitro activity of antimicrobial agents against rapidly growing aerobic bacteria involved in infectious diseases .
    [15] NF EN 1040 (2006) .T72–152 Chemical disinfectants and antiseptics-Quantitative suspension test for the evaluation of basic bactericidal activity of chemical disinfectants and antiseptics - Test method and requirements (phase 1).
    [16] NF EN 1276 (2019) .T72–173 Chemical disinfectants and antiseptics-Quantitative suspension test for the evaluation of bactericidal activity of chemical disinfectants and antiseptics used in food, industrial, domestic and institutional areas - Test method and requirements (phase 2, step 1).
    [17] Parvekar P, Palaskar J, Metgud S, et al. (2020) The minimum inhibitory concentration (MIC) and minimum bactericidal concentration (MBC) of silver nanoparticles against Staphylococcus aureusBiomater Investig Dent 7: 105-109.
    [18] NF T72-190 (1988) .T72–190 Water-miscible contact disinfectants used in liquid state. Germ carrier method. Determination of the bactericidal, fungicidal and sporicidal activity.
    [19] Parker AE, Walker DK, Goeres DM, et al. (2014) Ruggedness and reproducibility of the MBEC biofilm disinfectant efficacy test. J Microbiol Methods 102: 55-64.
    [20] Lefebvre E, Vighetto C, Di Martino P, et al. (2016) Synergistic antibiofilm efficacy of various commercial antiseptics, enzymes and EDTA: a study of Pseudomonas aeruginosa and Staphylococcus aureus biofilms. Int J Antimicrob Agents 48: 181-188.
    [21] Gilbert P, Das JR, Jones MV, et al. (2001) Assessment of resistance towards biocides following the attachment of micro-organisms to, and growth on, surfaces. J Appl Microbiol 91: 248-254.
    [22] European Chemicals Agency (2018) .Guidance on the BPR: Volume II Efficacy-Assessment and Evaluation (Parts B + C).
    [23] Helmi K, David F, Di Martino P, et al. (2018) Assessment of flow cytometry for microbial water quality monitoring in cooling tower water and oxidizing biocide treatment efficiency. J Microbiol Methods 152: 201-209.
    [24] Nicol M, Ben Mlouka MA, Berthe T, et al. (2019) Anti-persister activity of squalamine against Acinetobacter baumanniiInt J Antimicrob Agents 53: 337-342.
    [25] Ayrapetyan M, Williams TC, Baxter R, et al. (2015) Viable but nonculturable and persister cells coexist stochastically and are induced by human serum. Infect Immun 83: 4194-4203.
    [26] Brinksmeier E, Meyer D, Huesmann-Cordes AG, et al. (2015) Metalworking fluids—Mechanisms and performance. CIRP Annals 64: 605-628.
    [27] Cook PE, Gaylarde CC (1988) Biofilm formation in aqueous metal working fluids. Int Biodeter 24: 265-270.
    [28] Ortiz C, Guiamet PS, Videla HA (1990) Relationship between biofilms and corrosion of steel by microbial contaminants of cutting-oil emulsions. Int Biodeter 26: 315-326.
    [29] Morton LHG, Greenway DLA, Gaylarde CC, et al. (1998) Consideration of some implications of the resistance of biofilms to biocides. Int Biodeter Biodegrad 41: 247-259.
    [30] Falkinham JO (2009) Effects of biocides and other metal removal fluid constituents on Mycobacterium immunogenumAppl Environ Microbiol 75: 2057-2061.
    [31] Shennan JL (1983) Selection and evaluation of biocides for aqueous metal-working fluids. Tribol Int 16: 317-330.
    [32] Gilbert Y, Veillette M, Duchaine C (2010) Metalworking fluids biodiversity characterization. J Appl Microbiol 108: 437-449.
    [33] Rabenstein A, Koch T, Remesch M, et al. (2009) Microbial degradation of water miscible metal working fluids. Int Biodeter Biodegrad 63: 1023-1029.
    [34] Sandin M, Mattsby-Baltzer I, Edebo L (1991) Control of microbial growth in water-based metal-working fluids. Int Biodeter 27: 61-74.
    [35] van der Gast CJ, Knowles CJ, Wright MA, et al. (2001) Identification and characterisation of bacterial populations of an in-use metal-working fluid by phenotypic and genotypic methodology. Int Biodeter Biodegrad 47: 113-123.
    [36] Moore JS, Christensen M, Wilson MRW, et al. (2000) Mycobacterial contamination of metal working fluids: involvement of a possible new taxon of rapidly growing mycobacteria. Am Ind Hyg Assoc J 61: 205-213.
    [37] Kapoor R, Yadav JS (2012) Expanding the mycobacterial diversity of metalworking fluids (MWFs): evidence showing MWF colonization by Mycobacterium abscessusFEMS Microbiol Ecol 79: 392-399.
    [38] Wilson RW, Steingrube VA, Bottger EC, et al. (2001) Mycobacterium immunogenum sp. nov., a novel species related to Mycobacterium abscessus and associated with clinical disease, pseudo-outbreaks and contaminated metalworking fluids: an international cooperative study on mycobacterial taxonomy. Int J Syst Evol Microbiol 51: 1751-1764.
    [39] Burton CM, Crook B, Scaife H, et al. (2012) Systematic review of respiratory outbreaks associated with exposure to water-based metalworking fluids. Ann Occup Hyg 56: 374-388.
    [40] Trafny EA (2013) Microorganisms in metalworking fluids: current issues in research and management. Int J Occup Med Environ Health 26: 4-15.
    [41] Moscoso F, Deive FJ, Villar P, et al. (2012) Assessment of a process to degrade metal working fluids using Pseudomonas stutzeri CECT 930 and indigenous microbial consortia. Chemosphere 86: 420-426.
    [42] Molin G, Nilsson I (1985) Degradation of phenol by Pseudomonas putida ATCC11172 in continuous culture at different ratios of biofilm surface to culture volume. Appl Environ Microbiol 50: 946-950.
    [43] Lundov MD, Johansen JD, Zachariae C, et al. (2011) Low-level efficacy of cosmetic preservatives. Int J Cosmet Sci 33: 190-196.
    [44] Sondossi M, Riha VF, Rossmoore HW, et al. (1993) Factors involved in bactericidal activities of formaldehyde and formaldehyde and formaldehyde condensate/isothiazolone mixtures. Int Biodeter Biodegrad 32: 243-261.
    [45] Lambert RJ, Johnston MD, Hanlon GW, et al. (2003) Theory of antimicrobial combinations: biocide mixtures—synergy or addition? J Appl Microbiol 94: 747-759.
    [46] Bhattacharya A (2017) Biocides for metal working fluids: India outlook. Lube Magazine 138: 37-40.
    [47] Alakomi HL, Paananen A, Suihko ML, et al. (2006) Weakening effect of cell permeabilizers on Gram-negative bacteria causing biodeterioration. Appl Environ Microbiol 72: 4695-4703.
    [48] Zhao K, Wen J, Gu T, et al. (2005) Effects of biocides and a biocide enhancer on SRB growth. Annual Meeting Conference Proceedings New York: AIChE.
    [49] Wen J, Gu T (2007) Evaluations of a green biocide and a green biocide enhancer for the mitigation of biocorrosion using an electrochemical bioreactor. Annual Meeting Conference Proceedings New York: AIChE.
    [50] Wen J, Zhao K, Gu T, et al. (2009) A green biocide enhancer for the treatment of sulfate-reducing bacteria (SRB) biofilms on carbon steel surfaces using glutaraldehyde. Int Biodeter Biodegrad 63: 1102-1106.
    [51] Wen J, Xu D, Gu T, et al. (2012) A green triple biocide cocktail consisting of a biocide, EDDS and methanol for the mitigation of planktonic and sessile sulfate-reducing bacteria. World J Microbiol Biotechnol 28: 431-435.
    [52] Fox SJ, Fazil MH, Dhand C, et al. (2016) Insight into membrane selectivity of linear and branched polyethylenimines and their potential as biocides for advanced wound dressings. Acta Biomater 37: 155-164.
    [53] Di Maiuta N, Schwarzentruber P, Dow CS (2011) Enhancement of the antimicrobial performance of biocidal formulations used for the preservation of white mineral dispersions. Appl Microbiol Biotechnol 89: 429-439.
    [54] McKnight RF, Adida M, Budge K, et al. (2012) Lithium toxicity profile: a systematic review and meta-analysis. Lancet 379: 72-728.
    [55] Sondossi M, Riha VF, Rossmoore HW (1990) The potentiation of industrial biocide activity with Cu2+. I: synergistic effect of Cu2+ with formaldehyde. Int Biodeterior 26: 51-61.
    [56] Riha VF, Sondossi M, Rossmoore HW (1990) The potentiation of industrial biocide activity with Cu2+. II. Synergistic effects with 5-chloro-2-methyl-4-isothiazolin-3-one. Int Biodeter 26: 303-313.
    [57] Hordyjewska A, Popiołek Ł, Kocot J (2014) The many ‘faces’ of copper in medicine and treatment. Biometals 27: 611-621.
    [58] Taylor TM, Joerger R, Palou E, et al. (2012) Alternatives to traditional antimicrobials for organically processed meat and poultry. Organic meat production and processing Iowa State University Press, 211-237.
    [59] Mani-López E, García HS, López-Malo A (2012) Organic acids as antimicrobials to control Salmonella in meat and poultry products. Food Res Int 45: 713-721.
    [60] Stratford M, Plumridge A, Nebe-von-Caron G, et al. (2009) Inhibition of spoilage mould conidia by acetic acid and sorbic acid involves different modes of action, requiring modification of the classical weak-acid theory. Int J Food Microbiol 136: 37-43.
    [61] Cabezas-Pizarro J, Redondo-Solano M, Umaña-Gamboa C, et al. (2017) Antimicrobial activity of different sodium and potassium salts of carboxylic acid against some common foodborne pathogens and spoilage-associated bacteria. Rev Argent Microbiol 50: 56-61.
    [62] Surekha M, Reddy SM (2000) Preservatives, classification and properties. Encyclopedia of food microbiology New York: Academic Press, 1710-1717.
    [63] Zhou F, Ji B, Zhang H, et al. (2007) Synergistic effect of thymol and carvacrol combined with chelators and organic acids against Salmonella TyphimuriumJ Food Prot 70: 1704-1709.
    [64] Miller AJ, Call JE, Whiting RC (1993) Comparison of organic acid salts for Clostridium botulinum control in an uncured turkey product. J Food Prot 56: 958-962.
    [65] Arias-Moliz MT, Ferrer-Luque CM, Espigares-Rodríguez E, et al. (2008) Bactericidal activity of phosphoric acid, citric acid, and EDTA solutions against Enterococcus faecalisOral Surg Oral Med Oral Pathol Oral Radiol Endod 106: e84-e89.
    [66] Sofos N, Pierson MD, Blocher JC, et al. (1986) Mode of action of sorbic acid on bacterial cells and spores. Int J Food Microbiol 3: 1-17.
    [67] Garza S, Canela R, Vinas I, et al. (1993) Effects of potassium sorbate on growth and penicillic acid production by Aspergillus ochraceus and Penicillium aurantiogriseumZentralbl Mikrobiol 148: 343-350.
    [68] Finol ML, Marth EH, Lindsay RC (1982) Depletion of sorbate from different media during growth of Penicillium species. J Food Prot 45: 398-404.
    [69] Bae YM, Lee SY (2017) Effect of salt addition on acid resistance response of Escherichia coli O157: H7 against acetic acid. Food Microbiol 65: 74-82.
    [70] Würgler FE, Schlatter J, Maier P (1992) The genotoxicity status of sorbic acid, potassium sorbate and sodium sorbate. Mutat Res 283: 107-111.
    [71] Mohammadzadeh-Aghdash H, Sohrabi Y, Mohammadi A, et al. (2018) Safety assessment of sodium acetate, sodium diacetate and potassium sorbate food additives. Food Chem 257: 211-215.
    [72] Zhang J, Zhang C, Zhu Y, et al. (2018) Biodegradation of seven phthalate esters by Bacillus mojavensis B1811. Int Biodet Biodegrad 132: 200-207.
    [73] Hayakawa C, Fujii K, Funakawa S, et al. (2018) Effects of sorption on biodegradation of low-molecular-weight organic acids in highly-weathered tropical soils. Geoderma 324: 109-118.
    [74] Papageorgiou S, Varvaresou A, Tsirivas E, et al. (2010) New alternatives to cosmetics preservation. J Cosmet Sci 61: 107-123.
    [75] Kočevar Glavač N, Lunder M (2018) Preservative efficacy of selected antimicrobials of natural origin in a cosmetic emulsion. Int J Cosmet Sci 40: 276-284.
    [76] Chen D, Zhao T, Doyle MP (2015) Single- and mixed-species biofilm formation by Escherichia coli O157:H7 and Salmonella, and their sensitivity to levulinic acid plus sodium dodecyl sulfate. Food Control 57: 48-53.
    [77] Elramady MG, Aly SS, Rossitto PV, et al. (2013) Synergistic effects of lactic acid and sodium dodecyl sulfate to decontaminate Escherichia coli O157:H7 on cattle hide sections. Foodborne Pathog Dis 10: 661-663.
    [78] Winter M, Bock R, Herrmann C, et al. (2012) Technological evaluation of a novel glycerol-based biocide-free metalworking fluid. J Clean Prod 35: 176-182.
    [79] Wichmann H, Stache H, Schmidt C, et al. (2013) Ecological and economic evaluation of a novel glycerol-based biocide-free metalworking fluid. J Clean Prod 43: 12-19.
    [80] Gelinski S, Winter M, Wichmann H, et al. (2016) Development and testing of a novel glycerol/chitosan-based biocide-free hydraulic fluid. J Clean Prod 112: 3589-3596.
  • Reader Comments
  • © 2021 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(6337) PDF downloads(476) Cited by(15)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog