DAU-Net: A medical image segmentation network combining the Hadamard product and dual scale attention gate

Xiaoyan Zhang; Mengmeng He; Hongan Li; Xiaoyan Zhang; Mengmeng He; Hongan Li

doi:10.3934/mbe.2024122

Mathematical Biosciences and Engineering

2024, Volume 21, Issue 2: 2753-2767. doi: 10.3934/mbe.2024122

Previous Article Next Article

Research article Special Issues

DAU-Net: A medical image segmentation network combining the Hadamard product and dual scale attention gate

1.
School of Computer Science and Technology, Xi'an University of Science and Technology, Xi'an 710054, China
2.
State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, Beijing 100191, China

Academic Editor: Yudong Zhang

Received: 13 November 2023 Revised: 01 January 2024 Accepted: 15 January 2024 Published: 24 January 2024

Medical image segmentation has an important application value in the modern medical field, it can help doctors accurately locate and analyze the tissue structure, lesion areas, and organ boundaries in the image, which provides key information support for clinical diagnosis and treatment, but there are still a large number of problems in the accuracy of the segmentation, so in this paper, we propose a medical image segmentation network combining the Hadamard product and dual-scale attention gate (DAU-Net). First, the Hadamard product is introduced in the structure of the fifth layer of the codec for element-by-element multiplication, which can generate feature representations with more representational capabilities. Second, in the jump connection module, we propose a dual scale attention gating (DSAG), which can highlight more valuable features and achieve more efficient jump connections. Finally, in the decoder feature structure, the final segmentation result is obtained by aggregating the feature information provided by each part, and decoding is achieved by up-sampling operation. Through experiments on two public datasets, Luna and Isic2017, DAU-Net is able to extract feature information more efficiently using different modules and has better segmentation results compared to classical segmentation models such as U-Net and U-Net++, and also verifies the effectiveness of the model.
- medical image segmentation,
- two-scale attention gates,
- Hadamard product,
- fully convolutional networks,
- feature fusion
Citation: Xiaoyan Zhang, Mengmeng He, Hongan Li. DAU-Net: A medical image segmentation network combining the Hadamard product and dual scale attention gate[J]. Mathematical Biosciences and Engineering, 2024, 21(2): 2753-2767. doi: 10.3934/mbe.2024122

Related Papers:

Abstract

Medical image segmentation has an important application value in the modern medical field, it can help doctors accurately locate and analyze the tissue structure, lesion areas, and organ boundaries in the image, which provides key information support for clinical diagnosis and treatment, but there are still a large number of problems in the accuracy of the segmentation, so in this paper, we propose a medical image segmentation network combining the Hadamard product and dual-scale attention gate (DAU-Net). First, the Hadamard product is introduced in the structure of the fifth layer of the codec for element-by-element multiplication, which can generate feature representations with more representational capabilities. Second, in the jump connection module, we propose a dual scale attention gating (DSAG), which can highlight more valuable features and achieve more efficient jump connections. Finally, in the decoder feature structure, the final segmentation result is obtained by aggregating the feature information provided by each part, and decoding is achieved by up-sampling operation. Through experiments on two public datasets, Luna and Isic2017, DAU-Net is able to extract feature information more efficiently using different modules and has better segmentation results compared to classical segmentation models such as U-Net and U-Net++, and also verifies the effectiveness of the model.

References

[1]	H. Wang, B. Ji, G. He, W. Yu, A computed tomography image segmentation algorithm for improving the diagnostic accuracy of rectal cancer based on U-net and residual block, J. Biomed. Eng., 39 (2022), 166–174. https://doi.org/10.7507/1001-5515.201910027 doi: 10.7507/1001-5515.201910027
[2]	M. Yue, Q. Wei, W. Deng, T. Wang, Y. Deng, B. Huang, A review of automatic liver tumor segmentation based on computed tomography, J. Biomed. Eng., 35 (2018), 481–487. https://doi.org/10.7507/1001-5515.201708009 doi: 10.7507/1001-5515.201708009
[3]	G. Dai, H. Sun, O. Yang, Research on image resolution problem in ultrasonic imaging detection, Comput. Knowl. Technol., 6 (2010), 5937–5939. https://doi.org/10.3969/j.issn.1009-3044.2010.21.115 doi: 10.3969/j.issn.1009-3044.2010.21.115
[4]	W. M. Salama, A. Shokry, A novel framework for brain tumor detection based on convolutional variational generative models, Multim. Tools Appl., 81 (2022), 16441–16454. https://doi.org/10.1007/s11042-022-12362-9 doi: 10.1007/s11042-022-12362-9
[5]	S. Sultana, A. Robinson, D. Y. Song, J. Lee, Automatic multi-organ segmentation in computed tomography images using hierarchical convolutional neural network, J. Med. Imaging, 7 (2020), 055001. https://doi.org/10.1117/1.JMI.7.5.055001 doi: 10.1117/1.JMI.7.5.055001
[6]	V. Venugopal, J. Joseph, M. V. Das, M. K. Nath, DTP-Net: A convolutional neural network model to predict threshold for localizing the lesions on dermatological macro-images, Comput. Biol. Med., 148 (2022), 105852. https://doi.org/10.1016/j.compbiomed.2022.105852 doi: 10.1016/j.compbiomed.2022.105852
[7]	J. H. Pujar, P. S. Gurjal, K. S. Kunnur, Medical image segmentation based on vigorous smoothing and edge detection ideology, Int. J. Electr. Comput. Eng., 4 (2020), 1143–1149.
[8]	K. Bhargavi, S. Jyothi, A survey on threshold based segmentation technique in image processing, Int. J. Innov. Res. Dev., 3 (2014), 234–239.
[9]	T. A. Jemimma, Y. J. Vetharaj, Watershed algorithm based DAPP features for brain tumor segmentation and classification, in 2018 International Conference on Smart Systems and Inventive Technology (ICSSIT), (2018), 155–158. https://doi.org/10.1109/ICSSIT.2018.8748436
[10]	S. Madhukumar, N. Santhiyakumari, Evaluation of k-Means and fuzzy C-means segmentation on MR images of brain, Egyptian J. Radiol. Nuclear Med., 46 (2015), 475–479. https://doi.org/10.1016/j.ejrnm.2015.02.008 doi: 10.1016/j.ejrnm.2015.02.008
[11]	Z. Zheng, X. Zhang, S. Zheng, Y. Shi, CT liver image segmentation based on region growing and uniformized level set, J. Zhejiang Univ., 52 (2018), 2382–2396.
[12]	B. Qian, Z. Xiao, W. Song, Improved convolutional neural network for segmentation on lung images, Comput. Sci. Explor., 14 (2020), 1358–1367.
[13]	H. Li, Q. Zheng, W. Yan, R. Tao, X. Qi, Z. Wen, Image super-resolution reconstruction for secure data transmission in Internet of Things environment, Math. Biosci. Eng., 18 (2021), 6652–6671. https://doi.org/10.3934/mbe.2021330 doi: 10.3934/mbe.2021330
[14]	A. Rehman, M. Harouni, F. Zogh, T. Saba, M. Karimi, G. Jeon, Detection of Lung Tumors in CT Scan Images using Convolutional Neural Networks, in IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2023. https://doi.org/10.1109/TCBB.2023.3315303
[15]	J. Long, E. Shelhamer, T. Darrell, Fully convolutional networks for semantic segmentation, in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2015), 3431–3440. https://doi.org/10.1109/CVPR.2015.7298965
[16]	O. Ronneberger, P. Fischer, T. Brox, U-net: Convolutional networks for biomedical image segmentation, in Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015, (2015), 234–241.
[17]	O. Oktay, J. Schlemper, L. Folgoc L, M. Lee, M. Heinrich, K. Misawa, et al., Attention u-net: Learning where to look for the pancreas, preprint, arXiv: 1804.03999.
[18]	Z. Zhou, M M R. Siddiquee, N. Tajbakhsh, J. Liang, Unet++: Redesigning skip connections to exploit multiscale features in image segmentation, IEEE Trans. Med. Imaging, 39 (2019), 1856–1867. https://doi.org/10.1109/TMI.2019.2959609 doi: 10.1109/TMI.2019.2959609
[19]	H. Huang, L. Lin, R. Tong, H. Hu, Q. Zhang, Y. Iwamoto, et al., UNet 3+: a full-scale connected unet for medical image segmentation, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), (2020), 1055–1059. https://doi.org/10.1109/ICASSP40776.2020.9053405
[20]	K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE conference on computer vision and pattern recognition, (2016), 770–778.
[21]	Z. Alom, M. Hasan, C. Yakopcic, T. M. Taha, V. K. Asari, Recurrent residual convolutional neural network based on u-net (r2u-net) for medical image segmentation, preprint, arXiv: 1802.06955.
[22]	J. Chen, Y. Lu, Q. Yu, X. Luo, E. Adeli, Y. Wang, et al., Transunet: Transformers make strong encoders for medical image segmentation, preprint, arXiv: 2102.04306.
[23]	J. Ruan, M. Xie, J. Gao, T. Liu, Y. Fu, EGE-UNet: an Efficient Group Enhanced UNet for skin lesion segmentation, in International Conference on Medical Image Computing and Computer-Assisted Intervention, (2023), 481–490. https://doi.org/10.1007/978-3-031-43901-8_46
[24]	J. Valanarasu, M. Patel, Unext: Mlp-based rapid medical image segmentation network, in International Conference on Medical Image Computing and Computer-Assisted Intervention, (2022), 23–33. https://doi.org/10.1007/978-3-031-16443-9_3
[25]	F. Tang, J. Ding, L. Wang, C. Ning, S. K. Zhou, CMUNeXt: An efficient medical image segmentation network based on large kernel and skip fusion, preprint, arXiv: 2308.01239.
[26]	J. Lewis, Y. J. Cha, J. Kim, Dual encoder–decoder-based deep polyp segmentation network for colonoscopy images, Sci. Rep., 1183 (2023). https://doi.org/10.1038/s41598-023-28530-2 doi: 10.1038/s41598-023-28530-2
[27]	C. Liang-Chieh, Z. Yukun, G. Papandreou, F. Schroff, H. Adam, Encoder-Decoder with atrous separable convolution for semantic image segmentation, in Proceedings of the European conference on computer vision (ECCV), 2018.
[28]	R. Yupeng, H. Jisheng, H. Zhiyou, W. Lu, J. Yin, L. Zou, et al., Image-based concrete crack detection in tunnels using deep fully convolutional networks, Constr. Building Mater., 234 (2020), 117367. https://doi.org/10.1016/j.conbuildmat.2019.117367 doi: 10.1016/j.conbuildmat.2019.117367
[29]	F. Yang, L. Zhang, S. Yu, D. Prokhorov, X. Mei, H. Ling, Feature pyramid and hierarchical boosting network for pavement crack detection, IEEE Trans. Intell. Transport. Syst., 21 (2020), 1525–1535. https://doi.org/10.1109/TITS.2019.2910595 doi: 10.1109/TITS.2019.2910595
[30]	R. Ali, Y. Cha, Attention-based generative adversarial network with internal damage segmentation using thermography, Autom. Constr., 141 (2022), 104412. https://doi.org/10.1016/j.autcon.2022.104412 doi: 10.1016/j.autcon.2022.104412
[31]	D. Kang, Y. Cha, Efficient attention-based deep encoder and decoder for automatic cracksegmentation, Struct. Health Monitor., 21 (2022), 2190–2205. https://doi.org/10.1177/14759217211053776 doi: 10.1177/14759217211053776

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)