The dynamic behavior of the keyhole and molten pool is associated with the quality of weld seam. In this study, an on-line visual monitoring system is devised to photograph the keyhole and molten pool during external magnetic field assisted laser welding on the AISI 2205 duplex stainless steel plates. Seven features are defined to describe the morphology of the keyhole and molten pool. Then, the principal component analysis (PCA) algorithm is applied to reduce the dimensions of these features to obtain different number of principal components (PCs). Three different machine learning algorithms, i.e. the back propagation neural network (BPNN), the radial based function neural network (RBFNN) and the support vector regression (SVR), are utilized to fit the relationship between the chosen PCs and the weld width. Finally, the global and local prediction accuracy of these three machine learning algorithms are compared under different number of PCs. Results illustrated that data dimensionality reduction is helpful to improve the modeling efficiency. Machine learning algorithms can be exploited to predict the weld quality during laser welding with high accuracy. Among them, the BPNN model performs best and SVR model performs better than RBFNN model in this research. This work aims to model the relation between the features in weld zone and the weld quality with different machine learning algorithms, and provides a guideline of model selection for laser welding on-line monitoring and a necessary foundation for realizing intelligent welding with advanced algorithm.
Citation: Wang Cai, Jianzhuang Wang, Longchao Cao, Gaoyang Mi, Leshi Shu, Qi Zhou, Ping Jiang. Predicting the weld width from high-speed successive images of the weld zone using different machine learning algorithms during laser welding[J]. Mathematical Biosciences and Engineering, 2019, 16(5): 5595-5612. doi: 10.3934/mbe.2019278
Related Papers:
[1]
Jun Gao, Qian Jiang, Bo Zhou, Daozheng Chen .
Convolutional neural networks for computer-aided detection or diagnosis in medical image analysis: An overview. Mathematical Biosciences and Engineering, 2019, 16(6): 6536-6561.
doi: 10.3934/mbe.2019326
[2]
Haiyan Song, Cuihong Liu, Shengnan Li, Peixiao Zhang .
TS-GCN: A novel tumor segmentation method integrating transformer and GCN. Mathematical Biosciences and Engineering, 2023, 20(10): 18173-18190.
doi: 10.3934/mbe.2023807
[3]
Tongping Shen, Fangliang Huang, Xusong Zhang .
CT medical image segmentation algorithm based on deep learning technology. Mathematical Biosciences and Engineering, 2023, 20(6): 10954-10976.
doi: 10.3934/mbe.2023485
[4]
Jinyun Jiang, Jianchen Cai, Qile Zhang, Kun Lan, Xiaoliang Jiang, Jun Wu .
Group theoretic particle swarm optimization for gray-level medical image enhancement. Mathematical Biosciences and Engineering, 2023, 20(6): 10479-10494.
doi: 10.3934/mbe.2023462
[5]
Tong Shan, Jiayong Yan, Xiaoyao Cui, Lijian Xie .
DSCA-Net: A depthwise separable convolutional neural network with attention mechanism for medical image segmentation. Mathematical Biosciences and Engineering, 2023, 20(1): 365-382.
doi: 10.3934/mbe.2023017
[6]
Mei-Ling Huang, Zong-Bin Huang .
An ensemble-acute lymphoblastic leukemia model for acute lymphoblastic leukemia image classification. Mathematical Biosciences and Engineering, 2024, 21(2): 1959-1978.
doi: 10.3934/mbe.2024087
Tingxi Wen, Hanxiao Wu, Yu Du, Chuanbo Huang .
Faster R-CNN with improved anchor box for cell recognition. Mathematical Biosciences and Engineering, 2020, 17(6): 7772-7786.
doi: 10.3934/mbe.2020395
[9]
Vasileios E. Papageorgiou, Georgios Petmezas, Pantelis Dogoulis, Maxime Cordy, Nicos Maglaveras .
Uncertainty CNNs: A path to enhanced medical image classification performance. Mathematical Biosciences and Engineering, 2025, 22(3): 528-553.
doi: 10.3934/mbe.2025020
[10]
Shen Jiang, Jinjiang Li, Zhen Hua .
Transformer with progressive sampling for medical cellular image segmentation. Mathematical Biosciences and Engineering, 2022, 19(12): 12104-12126.
doi: 10.3934/mbe.2022563
Abstract
The dynamic behavior of the keyhole and molten pool is associated with the quality of weld seam. In this study, an on-line visual monitoring system is devised to photograph the keyhole and molten pool during external magnetic field assisted laser welding on the AISI 2205 duplex stainless steel plates. Seven features are defined to describe the morphology of the keyhole and molten pool. Then, the principal component analysis (PCA) algorithm is applied to reduce the dimensions of these features to obtain different number of principal components (PCs). Three different machine learning algorithms, i.e. the back propagation neural network (BPNN), the radial based function neural network (RBFNN) and the support vector regression (SVR), are utilized to fit the relationship between the chosen PCs and the weld width. Finally, the global and local prediction accuracy of these three machine learning algorithms are compared under different number of PCs. Results illustrated that data dimensionality reduction is helpful to improve the modeling efficiency. Machine learning algorithms can be exploited to predict the weld quality during laser welding with high accuracy. Among them, the BPNN model performs best and SVR model performs better than RBFNN model in this research. This work aims to model the relation between the features in weld zone and the weld quality with different machine learning algorithms, and provides a guideline of model selection for laser welding on-line monitoring and a necessary foundation for realizing intelligent welding with advanced algorithm.
1.
Introduction
Breast cancer is one of the most common malignant tumors in women. In 2013, there were about 180 million new cases of female breast cancer in the world [1,2]. In China, the incidence rate of breast cancer in 2013 was 42.02/10 million, ranking the first place in the incidence of cancer in women, and the mortality rate was 9.74/10 million, ranking fifth in the female cancer death cause [3,4,5]. Breast cancer seriously endangers women's life and health, and brings huge economic burden to the family and society [6].
At present, the traditional statistical methods to analyze the medical expenses and influencing factors of breast cancer include multiple linear regression, logistic regression analysis and so on [7,8,9,10]. Back propagation (BP) neural network is an artificial neural network model based on error back propagation [11]. Compared with traditional statistical methods, it has no special requirements for the type and distribution of data, and has some fault tolerance, so it has more advantages [12,13,14,15]. In recent years, BP neural network model and other data mining methods have also been applied to analyze the medical expenses of cancer patients such as gastric cancer, lung cancer, liver cancer, gynecological cancer and so on [16,17,18].
In this study, the medical expenses of female breast cancer patients in rural areas of Anhui Province were selected as the research object. After the BP neural network model was constructed, the results were compared with the multiple linear model. The aim was to verify the effectiveness and predictive power of the two models for the influencing factors of medical expenses of breast cancer patients, so as to provide scientific theoretical basis for reasonable control of medical expenses of breast cancer.
2.
Materials and methods
2.1. General data
According to the data of cancer registration report in Anhui Province, the medical records of all female breast cancer patients diagnosed from 2017 to 2019 and coded as C50 according to ICD-10 disease code were collected from four county people's hospitals (Feidong, Feixi, Changfeng, and Lujiang) in Anhui Province, including hospitalization number, medical insurance number, medical payment method, operation, chemotherapy and radiotherapy; Medical insurance number, name, date of birth, disease diagnosis, total cost of hospitalization, hospital and other information were collected from social security department. A total of 846 samples were collected. All research variables were assigned values (Table 1). After the samples with missing values and illogicality were excluded, 795 cases were included in the study, and the effective rate was 93.97%.
BP neural network is the abbreviation of error back propagation neural network. It is a multilayer feedforward network trained by error back propagation algorithm. It is one of the most widely used neural network models at present. It is composed of an input layer, one or more hidden layers and an output layer. Each layer is composed of a certain number of neurons. Its structure is shown in Figure 1. First, the number of neurons in each layer is determined. The number of neurons in the input layer and the output layer is determined by the independent and dependent variables of the actual research. There is no unified calculation method for the number of neurons in the hidden layer. In this study, the formula (M represents the number of neurons in the hidden layer, n represents the number of neurons in the input layer, m represents the number of neurons in the output layer, and represents any value from 1 to 10) is used to calculate the number of neurons in the hidden layer. Secondly, network training. Through SSPS Clementine 12.0 software to simulate the training set, different BP neural network models are obtained. Finally, the network test is carried out. The test set is substituted into each model, and the model with the highest accuracy is the optimal model.
Figure 1.
Structure of back propagation neural network model.
Multiple linear regression is a regression analysis based on the given values of multiple explanatory variables. It is a method to study the linear relationship between a dependent variable and multiple independent variables [19]. The general form of multiple linear regression model is:
Y=β0+β1x1+β2x2+β3x3+I+βixi+e
(1)
Where β0 is a constant term, I is the number of independent variables, βi (i = 1, 2… i) is partial regression coefficient, e is random error. The meaning of partial regression coefficient is the average change of dependent variable Y when the independent variable changes one unit while other independent variables remain unchanged. After the multiple linear regression parameters are obtained, it is necessary to carry out statistical tests to determine the reliability of the model performance, including the fitting test (coefficient of determination), the significance test of the overall linear equation (F test), the significance test of variables (t test), etc. [20].
2.4. Model evaluation method
The performance of the two models is evaluated by the following indicators, where R2 is the coefficient of determination, which can explain the percentage of the independent variable explaining the change of the dependent variable. The value range is between 0 and 1. The closer the value is to 1, the better the fitting degree of the model to the sample is [21]. The absolute mean error (MAE) reflects the actual error of the model, and the root mean square error (RMSE) is the arithmetic square root of the mean square error (MSE). The calculation formula of each index is as follows.
R2=1−∑ni=1(yi−^yl)2∑ni=1(yi−¯yl)2
(2)
MAE=1n∑ni=1|∧yl−yi|
(3)
MSE=1n∑ni=1wi(yi−∧yi)2
(4)
RMSE=√1n∑ni=1wi(yi−∧yi)2
(5)
The predicted data of all patients were test by the normality test. The data both in back propagation neural network model and multiple linear regression model were consistent with normal distribution. After passing the homogeneity test, the difference between the two models were compared by t test.
3.
Result
3.1. Basic information of patients
The median medical expenses of breast cancer patients were (24576 ± 4792) RMB, and the hospitalization days was (31.4 ± 6.7) d. The composition ratio of other variables is shown in Table 2.
Table 2.
Basic information of hospitalized patients with breast cancer.
The SPSS Clementine 12.0 data mining platform was used to construct BP neural network model and multiple linear regression model for medical expenses of female breast cancer patients. BP neural network model takes nine indicators such as region, year of diagnosis and medical payment mode as input and logarithm of medical expenses as output. It adopts random sampling method and takes 70% samples as training set and 30% samples as test set. After repeated verification, a three-layer BP neural network model is finally constructed, with 9 neurons in the input layer, 10 neurons in the hidden layer and 1 neuron in the output layer, and the accuracy is 95.97%. The model results are shown in Table 3.
Table 3.
Summary of Back propagation neural network model results.
In the multiple linear regression model, 9 indicators such as region, year of diagnosis and medical payment mode were taken as independent variables, and the logarithm of medical expenses was taken as dependent variable. The multiple correlation coefficient was 0.841, which indicated that the model fitted well. The probability of F statistic of the model population linear test (P < 0.001) indicates that there is a significant linear relationship between independent variables and dependent variables. The model results are shown in Table 4.
Table 4.
Summary of multiple linear regression models.
3.3. Comparative analysis of influencing factors of medical expenses of breast cancer patients
BP neural network model gives the sensitivity of each variable, that is, the influence of each variable change on medical expenses. The analysis and comparison results of the two models are shown in Table 5. It can be seen that the top two influencing factors for medical expenses of breast cancer patients are length of stay and region; whether radiotherapy, surgery, age and chemotherapy also have a greater impact on medical expenses; medical payment method, diagnosis year and clinical stage have a smaller impact on medical expenses.
Table 5.
Comparative analysis of influencing factors of medical expenses in patients with breast cancer.
3.4. Comparison between back propagation neural network model and multiple linear regression model
All samples were substituted into the established BP neural network model and multiple linear regression model to evaluate the performance of the two models. The results are shown in Table 6. The coefficient of determination (R2) of BP neural network model was larger than that of multiple linear regression model, so the fitting degree of BP neural network model was better than that of multiple linear regression model. The MAE, MSE and RMSE values of BP neural network model were less than those of multiple linear regression model, so the prediction ability of BP neural network model was better than that of multiple linear regression model.
Table 6.
Comparison between back propagation neural network model and multiple linear regression model.
In this study, the median medical expenses and length of stay of breast cancer patients were (24576 ± 4792) RBM and (31.4 ± 6.7) d. The high cost of hospitalization not only affects the delay of treatment, but also hinders the choice of treatment [22,23]. At the same time, breast cancer brings huge disease burden to patients. Therefore, it is suggested to strengthen cancer screening for women, so as to achieve early detection, early diagnosis and early treatment, so as to reduce the economic burden and disease burden for individuals, families and society.
The results show that BP neural network model and multiple linear regression model can fit the data well. Through the comparative analysis of the influencing factors of medical expenses of breast cancer patients, it can be seen that the hospitalization days and regions are consistent in the two models, whether surgery, age of diagnosis, chemotherapy, medical payment method, year of diagnosis and clinical stage are basically consistent in the two models, but whether radiotherapy is inconsistent in the two models. The length of stay has the greatest impact on the cost of hospitalization, which is consistent with the existing research. Therefore, on the premise of ensuring the level of medical services, shortening the length of stay is an effective measure to reduce the medical expenses of breast cancer patients. There are also great differences in the medical expenses of breast cancer among regions, mainly due to the different economic development and medical technology levels. It is suggested that medical service providers can standardize the clinical pathway and provide efficient and affordable treatment for patients. Medical insurance is the main means to reduce the economic burden of rural patients. In this study, it has little impact on medical expenses. It is suggested that medical insurance policy makers should strengthen the compensation for rural cancer patients and reduce their economic risks.
In recent years, BP neural network model has been widely used in the field of medicine, and achieved good results [24,25]. Although BP neural network model has some disadvantages, such as over training, slow convergence speed, easy to fall into local optimum, it has no requirements for data type and distribution, has certain fault tolerance, and can correct errors repeatedly in the process of self-learning [26,27]. These advantages have great advantages in dealing with medical data with the characteristics of complexity and diversity. The results show that the determination coefficient of BP neural network model is greater than that of multiple linear regression model, and the values of MAE, RMSE and MSE are less than the corresponding values of multiple linear regression model, so its prediction ability is higher than that of multiple linear regression model.
5.
Conclusions
Compared with multiple linear regression model, BP neural network model is more suitable for the analysis of medical expenses in patients with breast cancer. However, the model itself has no advantages or disadvantages, only the applicable conditions of each model are different.
Acknowledgements
The authors would like to acknowledge the all the breast cancer patients for their participation and support.
Conflict of interest
The authors declared that there was no conflict of interests.
References
[1]
E. Assunção, L. Quintino and R. Miranda, Comparative study of laser welding in tailor blanks for the automotive industry, Int. J. Adv. Manuf. Tech., 49 (2010), 123–131.
[2]
X. Cao, M. Jahazi, J. Immarigeon, et al., A review of laser welding techniques for magnesium alloys,J. Mater. Process. Tech., 171 (2006), 188–204.
[3]
R. T. Yang and Z. W. Chen, A study on fiber laser lap welding of thin stainless steel, Int. J. Precis. Eng. Man., 14 (2013), 207–214.
[4]
J. Stavridis, A. Papacharalampopoulos and P. Stavropoulos, Quality assessment in laser welding: a critical review, Int. J. Adv. Manuf. Tech., 94 (2017), 1825–1847.
[5]
M. Jager and F. A. Hamprecht, Principal component imagery for the quality monitoring of dynamic laser welding processes, IEEE T. Ind. Electron., 56 (2008), 1307–1313.
[6]
L. E. Afanas'eva, I. A. Barabonova, P. O. Zorenko, et al., Laser welding in external electrical and magnetic fields, Weld. Int., 27 (2013), 545–547.
[7]
A. Schneider, V. Avilov, A. Gumenyuk, et al., Laser beam welding of aluminum alloys under the influence of an electromagnetic field, Phys. Procedia, 41 (2013), 4–11.
[8]
J. Volpp, Keyhole stability during laser welding-Part II: process pores and spatters, Prod. Eng. Res. Devel., 11 (2016), 9–18.
[9]
J. Stavridis, A. Papacharalampopoulos and P. Stavropoulos, A cognitive approach for quality assessment in laser welding, Procedia CIRP, 72 (2018), 1542–1547.
[10]
C. H. Kim and D. C. Ahn, Coaxial monitoring of keyhole during Yb:YAG laser welding, Opt. Laser Technol., 44 (2012), 1874–1880.
[11]
A. Matsunawa, J. D. Kim, N. Seto, et al., Dynamics of keyhole and molten pool in laser welding, J. Laser Appl., 10 (1998), 247–254.
[12]
G. Chen, M. Zhang, Z. Zhao, et al., Measurements of laser-induced plasma temperature field in deep penetration laser welding, Opt. Laser Technol., 45 (2013), 551–557.
[13]
J. Wang, C. Wang, X. Meng, et al., Study on the periodic oscillation of plasma/vapour induced during high power fibre laser penetration welding, Opt. Laser Technol., 44 (2012), 67–70.
[14]
C. Fan, F. Lv and S. Chen, Visual sensing and penetration control in aluminum alloy pulsed GTA welding, Int. J. Adv. Manuf. Tech., 42 (2009), 126–137.
[15]
R. Fabbro, S. Slimani, F. Coste, et al., Study of keyhole behaviour for full penetration Nd–Yag CW laser welding, J. Phys. D. Appl. Phys., 38 (2005), 1881.
[16]
K. Stefan, Process monitoring and control of laser beam welding: Measuring quantifiable data for improved processing results, Laser Technik J., 5 (2008), 41–43.
[17]
K. Stefan, Understanding the Laser Process new approaches for Process monitoring in laser materials Processing, Laser Technik J., 7, (2010), 49–52.
[18]
Z. Al-Sarraf and M. Lucas, A study of weld quality in ultrasonic spot welding of similar and dissimilar metals, J. Phys. Conference Series, 382 (2012), 012013.
[19]
L. Yang and I. C. Ume, Inspection of simulated weld penetration depth using laser-generated Lamb waves and wavelet signal processing, AIP Conference Proceedings. AIP, 1650 (2015), 1386–1391.
[20]
M. A. Maher, P. J. L. Webster, J. C. Chiao, et al., Coaxial real-time metrology and gas assisted laser micromachining: process development, stochastic behavior, and feedback control, Micromachining and Microfabrication Process Technology XV. International Society for Optics and Photonics, 7590 (2010), 759003.
[21]
P. J. L. Webster, G. L. Wright, Y. Ji, et al., Automatic laser welding and milling with in situ inline coherent imaging, Opt. Lett., 39 (2014), 6217–6220.
[22]
A. Khan, B. Baharudin, L. H. Lee, et al., A review of machine learning algorithms for text-documents classification, J. Adv. Inform. Tech., 1 (2010), 4–20.
[23]
P. Aivaliotis, A. Zampetis, G. Michalos, et al., A machine learning approach for visual recognition of complex parts in robotic manipulation, Procedia Manuf., 11 (2017), 423–430.
[24]
K. Kokkalis, G. Michalos, P. Aivaliotis, et al., An approach for implementing power and force limiting in sensorless industrial robots, Procedia CIRP, 76 (2018), 138–143.
[25]
X. Gao, C. Lan, D. You, et al., Weldment Nondestructive Testing Using Magneto-optical Imaging Induced by Alternating Magnetic Field,J. Nondestruct. Eval., 36 (2017), 55.
[26]
T. K. Xia, P. M. Hui and D. Stroud, Theory of faraday rotation in granular magnetic materials, J. Appl. Phys., 67 (1990), 2736–2741.
[27]
X. Wan, Y. Wang, D. Zhao, et al., Weld quality monitoring research in small scale resistance spot welding by dynamic resistance and neural network, Measurement, 99 (2017), 120–127.
[28]
M. Luo and Y. C. Shin, Estimation of keyhole geometry and prediction of welding defects during laser welding based on a vision system and a radial basis function neural network, Int. J. Adv. Manuf. Tech., 81 (2015), 263–276.
[29]
T. Wang, J. Chen, X. Gao, et al., Quality Monitoring for Laser Welding Based on High-Speed Photography and Support Vector Machine, Appl. Sci., 7 (2017), 299.
[30]
T. Wang, J. Chen, X. Gao, et al., Real-time monitoring for disk laser welding based on feature selection and SVM, Appl. Sci., 7, (2017), 884.
[31]
Z. Zhang, E. Kannatey-Asibu, S. Chen, et al., Online defect detection of Al alloy in arc welding based on feature extraction of arc spectroscopy signal, Int. J. Adv. Manuf. Tech., 79 (2015), 2067–2077.
[32]
J. Michalska and M. Sozańska, Qualitative and quantitative analysis of σ and χ phases in 2205 duplex stainless steel, Mater. Charact., 56 (2006), 355–362.
[33]
S. M. Holland, Principal components analysis (PCA), Department of Geology, University of Georgia, Athens,GA (2008), 30602–2501.
[34]
M. Buscema, Back propagation neural networks, Int. J. Addictions, 33 (1998), 233–270.
[35]
S. Chen, C. F. Cowan and P. M. Grant, Orthogonal least squares learning algorithm for radial basis function networks, IEEE T. Neural Networ., 2 (1991), 302–309.
[36]
H. Drucker, C. J. Burges, L. Kaufman, et al., Support vector regression machines, Adv. Neural Inform. Process. Sys., (1997), 155–161.
[37]
S. Altarazi, L. Hijazi and E. Kaiser, Process parameters optimization for multiple-inputs- multiple-outputs pulsed green laser welding via response surface methodology, 2016 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM). IEEE, (2016), 1041–1045.
[38]
C. J. Willmott, Some comments on the evaluation of model performance, B. Am. Meteorol. Soc., 63 (1982), 1309–1313.
[39]
J. Qian, J. Yi, Y. Cheng, et al., A sequential constraints updating approach for Kriging surrogate model-assisted engineering optimization design problem, Eng. Comput-Germany., (2019), 1–17.
[40]
Z. H. Han, C. Z. Xu, Z. Liang, et al., Efficient aerodynamic shape optimization using variable-fidelity surrogate models and multilevel computational grids, Chinese J. Aeronaut., (2019).
This article has been cited by:
1.
Li Wu, Zhijie Lv, Linjing Lai, Penglei Zhou,
Assessment of influencing factors of hospitalization expenses for Crohn's disease patients: Based on LASSO and linear mixed model,
2022,
10,
2296-2565,
10.3389/fpubh.2022.925616
Wei Guo, Guoyun Gao, Jun Dai, Qiming Sun, Kelvin Wong,
Prediction of Lung Infection during Palliative Chemotherapy of Lung Cancer Based on Artificial Neural Network,
2022,
2022,
1748-6718,
1,
10.1155/2022/4312117
4.
Yihao Zhu, Xiaojing Yang, Yao Zu,
Integrated analysis of WGCNA and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure,
2022,
10,
2296-634X,
10.3389/fcell.2022.1089915
5.
Yuanhong Che, Zhangdong Wei, Jing Wen, Hangjun Che,
Construction of Surface Water Pollution Prediction Model Based on Machine Learning,
2022,
2022,
1875-919X,
1,
10.1155/2022/7211094
6.
Hong Jiang, Sinuo Ren, Shengbo Zhang, Xudan Luo, Rui He, ShuaiFei Wang, JianDong Yan, Shan Zhou, Chengliang Yin, Ying Xiao, Zhihuan Li,
Analyzing factors influencing hospitalization costs for five common cancers in China using neural network models,
2025,
1369-6998,
1,
10.1080/13696998.2025.2494459
Wang Cai, Jianzhuang Wang, Longchao Cao, Gaoyang Mi, Leshi Shu, Qi Zhou, Ping Jiang. Predicting the weld width from high-speed successive images of the weld zone using different machine learning algorithms during laser welding[J]. Mathematical Biosciences and Engineering, 2019, 16(5): 5595-5612. doi: 10.3934/mbe.2019278
Wang Cai, Jianzhuang Wang, Longchao Cao, Gaoyang Mi, Leshi Shu, Qi Zhou, Ping Jiang. Predicting the weld width from high-speed successive images of the weld zone using different machine learning algorithms during laser welding[J]. Mathematical Biosciences and Engineering, 2019, 16(5): 5595-5612. doi: 10.3934/mbe.2019278