MonoDFNet: Monocular 3D Object Detection with Depth Fusion and Adaptive Optimization.

0 Người đánh giá. Xếp hạng trung bình 0

Tác giả: Ruohai Di, Yuhan Gao, Wei Hong, Liangliang Li, Xiaoyan Li, Mengyu Sun, Peng Wang

Ngôn ngữ: eng

Ký hiệu phân loại: 296.124 $Palestinian Talmud (Jerusalem Talmud, Talmud Yerushalmi)

Thông tin xuất bản: Switzerland : Sensors (Basel, Switzerland) , 2025

Mô tả vật lý:

Bộ sưu tập: NCBI

ID: 79394

Thêm vào giỏ Liên kết toàn văn

Monocular 3D object detection refers to detecting 3D objects using a single camera. This approach offers low sensor costs, high resolution, and rich texture information, making it widely adopted. However, monocular sensors face challenges from environmental factors like occlusion and truncation, leading to reduced detection accuracy. Additionally, the lack of depth information poses significant challenges for predicting 3D positions. To address these issues, this paper presents a monocular 3D object detection method based on improvements to MonoCD, designed to enhance detection accuracy and robustness in complex environments. In order to effectively obtain and integrate depth information, this paper designs a multi-branch depth prediction with weight sharing module. Furthermore, an adaptive focus mechanism is proposed to emphasize target regions while minimizing interference from irrelevant areas. The experimental results demonstrate that MonoDFNet achieves significant improvements over existing methods, with AP3D gains of +4.09% (Easy), +2.78% (Moderate), and +1.63% (Hard), confirming its effectiveness in 3D object detection.

Tạo bộ sưu tập với mã QR