基于BEV多模态特征融合的三维目标检测算法

打印
收藏

收藏成功

微博 QQ空间微信

打开文本图片集

关键词：BEV特征；数据预处理；全局注意力融合；KITTI数据集；三维目标检测中图分类号：TP391.4 文献标志码：A 文章编号：1001-3695（2025）12-038-3823-09doi：10.19734/j.issn.1001-3695.2025.03.0101

3D object detection algorithm based on BEV multimodal feature fusion

Li Xun1，Zhang Youbing1，2†，Zhou Kui1，Fu Yongzhi1，Song Jie1 （1.Sring-XJntbolfeltc，ubeo ShiyanHubei 4420oo，China；2.Jingchu Universityof Technology，Jingmen Hubei448000，China）

Abstract：Toaddress the problemof isuffcientdetectionaccuracyforsmalltargetobjectsinmultimodalfusionalgorithms， this paper proposeda3DobjectdetectionalgorithmbasedonBEVmultimodal featurefusion（BEV-MFDet）.Firstly，thealgorithmuniformlynormalizedtheimages toreducethediferenceinfeaturedistributionandappliedradiusfilteringandground fiting tothepointclouddatatoeliminatetheinfluenceofinvalidpointclouds.Atthesametime，thecolumnarnetwork integratedthespatialatentionmodule toenhanceitsabilitytolearpillrfeatures.Secondly，theBEVglobalattentionfusion moduleadaptivelyalocatedmultimodal weightstomore acuraterepresentationofscenefeatures.Experimentalresultsonthe KITTI dataset showthatthe proposedalgorithmachieves thebestoverallperformanceinmedium-diffcultydetection tasks.In particular，the detection accuracy of the small target cyclist class improves by 2.51% and 1. 75% compared to the CenterPoint andLINalgorithms withbetercomprehensive performance，respectively.Theexperimentalresults demonstrate thatthe proposedalgorithmhassuperiorperformance insmalltargetdetectionaccuracyandoveralldetectionperformance，and hashigh application value.

Key Words：BEV features；data preprocessing；global attention fusion；KITTI datasets；3D object detection

0 引言

随着人工智能技术的持续演进，自动驾驶技术已成为智能交通发展的核心方向。（剩余18597字）

试读结束

购买全文6.00元下一篇一种面向手绘草图的多阶段跨域图像检索方法

计算机应用研究

2025年12期

¥12.00/本