非对称的编码器-解码器架构下图像分割方法研究

打印
收藏

收藏成功

微博 QQ空间微信

打开文本图片集

中图分类号：TP391.41 文献标志码：A文章编号：1006-0316（2025）06-0001-08

doi：10.3969/j.issn.1006-0316.2025.06.001

Abstract：Traditional image segmentation techniques often rely on deep learning models based on Convolutional Neural Networks and Transformer architectures.Although these models excel at local feature extraction，they have limitations in capturing long-range dependencies.Moreover，such models tend to have a large number of parameters and high computational complexity，which results in significant computational burdens，especially in resource-constrained environments.To address this issue，this paper proposesa lightweight image segmentation method based on Mamba.By introducing Mamba’s eficient architecture combined with the classical U-Net structure， this method aims to tackle the challenges faced by image segmentation models in mobile device scenarios， such as large parameter sizes and inadequate processing speeds for real-time applications.Specificaly，the method incorporates Visual State Space （VSS） blocks，which are used alongside convolutions （CNN） to form hybrid building blocks for capturing extensive contextual information.Additionally，a non-symmetric encoder-decoder structure is designed.Experimentsonthe public dataset ISIC20l7 show that， while maintaining segmentation accuracy，the proposed model reduces the parameter count by 99.94% compared to traditional pure visual Mamba models， by 75.51% compared to the lightest existing visual Mamba U-Net model， and by 99.84% compared to the classic U-Net model. The designed model achieves significant reductions in computational complexity while maintaining excellent segmentation accuracy， thus meeting the demands ofreal-time applications.

Key words ∵ Mamba; image segmentation;lightweight mode;encoder;decoder

随着深度学习在计算机视觉领域的广泛应用，图像分割技术在医疗图像分析、自动驾驶、遥感影像处理等领域得到了越来越多的关注和应用。（剩余8861字）

试读结束

购买全文6.00元下一篇基于Archard理论的航空发动机零件磨损研究

机械

2025年06期

¥6.00/本