ISSN: XXXX-XXXX

Enhancing 3D Object Detection with Multi-Modal Fusion and Spatiotemporal Attention in Autonomous Driving

Abstract

This paper examines how to combine image and point cloud data in optimizing depth reliability, dynamic perception, enhancement of fusion features, robustness against sensor failures, and efficiency in various 3D object detection datasets on autonomous driving. This paper generally critically examines existing approaches, especially the Lift-Splat framework, and designs some new solutions to overcome current limitations. The research work focuses on the improvement of the depth accuracy, dynamic scene perception, and robustness of 3D detection systems by a series of hypotheses. It incorporates advanced fusion techniques, spatiotemporal deformable attention mechanisms, and optimized depth estimation ranges for achieving significant improvements in detection performance. Results from comprehensive experiments validate the effectiveness of these innovations across multiple datasets, thereby positioning the proposed method as a key advancement in the field of autonomous driving perception.

References

  1. Zhang, Y., et al. (2023). "LiDAR-camera fusion for autonomous driving perception: A comprehensive survey." IEEE Transactions on Intelligent Transportation Systems, 24(4), 1218-1232.
  2. Zhou, Z., et al. (2022). "Enhancing depth reliability through multi-modal fusion in autonomous driving." Journal of Field Robotics, 39(2), 235-250.
  3. Liu, X., et al. (2021). "Spatiotemporal attention for dynamic scene perception in autonomous driving." IEEE Transactions on Robotics, 37(8), 2344-2356.
  4. Li, H., et al. (2020). "Optimizing depth range estimation for 3D object detection in autonomous vehicles." Sensors, 20(10), 2882-2898.
  5. Wang, J., et al. (2023). "Robustness of LiDAR and camera fusion systems in adverse conditions." Autonomous Vehicles Journal, 5(1), 53-65.
Download PDF

How to Cite

K K Lavania, (2025-02-21 19:31:07.394). Enhancing 3D Object Detection with Multi-Modal Fusion and Spatiotemporal Attention in Autonomous Driving. Abhi International Journal of Artificial Intelligence Applications in Engineering, Volume zZUTWSDuBR78pc6zbKip, Issue 1.