基于多模态退化特征学习的水下图像增强

  • 打印
  • 收藏
收藏成功


打开文本图片集

中图分类号:TP391文献标志码:A

Multi-modal Degradation Feature Learning for Underwater Image Enhancement

XIONG Qingbo 1 ,CHEN Lei 1 , LIANG Xiaoli 1 , LIU Tianxu²

(1. School of Software,Henan University,Kaifeng 45OO46,Henan,China; 2.Henan Provincial Transportation Dispatching Command Center,Zhengzhou 45Oo16,Henan,China)

Abstract:Toaddress the lack of generalizationand flexibilityin traditional underwater image enhancementmodels,a multi-modal degraded contrastive language-image pre-training(MD-CLIP)model was proposed.MD-CLIP model was trainedusingcontrastive learning toencodetheimage featuresand textfeaturesof low-qualityunderwaterimages into multi-modaldegraded features.Across-atentionmechanismand prompt embedding wereused to integrate themultimodal degraded featurespredictedbyMD-CLIP modelintotheunderwaterimageenhancementmodel,adjustingthe model's performance and generalization.Ablation and comparativeexperiments were conducted to validate the ffectivenessof themulti-modal degraded features.Theresultsshow that the multi-modal degraded featurespredicted by MD-CLIP model were embed into theunderwater image enhancement modelbyusing cross-atention mechanism,the image enhancement performanceand generalization performance of the model are significantlyimproved.MD-CLIP model can be added to other image enhancement models as a universal enhancement module.

Keywords:underwaterimage enancement;multi-modaldegradation feature;;contrastivelearning;cross-attentionmechanism

近年来,随着海洋资源开发的兴起,水下图像增强技术备受关注。(剩余15082字)

monitor
客服机器人