A 3D object detection method that exploits visibility information from LiDAR point clouds to improve accuracy.
WYSIWYG is a 3D object detection method that leverages visibility information from LiDAR point clouds to improve detection accuracy. It addresses the problem of detecting objects in 3D environments where occlusion and partial visibility are common, such as in autonomous driving scenarios. The approach integrates raycasting to model visibility and feeds this into a deep learning detector for enhanced performance.
Researchers and engineers working on 3D computer vision, particularly in autonomous driving, robotics, or any field requiring accurate object detection from LiDAR data.
Developers choose WYSIWYG for its novel use of visibility cues, which leads to state-of-the-art detection accuracy on benchmarks like NuScenes. It provides an open-source implementation built on the established SECOND framework, making it accessible for experimentation and integration.
WYSIWYG: Exploiting Visibility for 3D Object Detection, CVPR 2020 (Oral)
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Uses raycasting to extract visibility volumes from LiDAR point clouds, improving detection in occluded scenes, as shown in the CVPR 2020 paper.
Integrates visibility information early in the detection pipeline for robust feature learning, leading to state-of-the-art performance on benchmarks.
Built on the established SECOND detector, ensuring access to its optimizations and community support, with pre-trained models available.
Provides pre-trained models and configurations for the NuScenes dataset, enabling quick evaluation and deployment for autonomous driving research.
Requires installing spconv, compiling code with CMake, and extensive setup from SECOND's documentation, making it barrier-heavy for newcomers.
Primarily designed for NuScenes; adapting to other datasets demands significant modification of data preprocessing and model configurations.
The raycasting step adds extra computation, which can impact inference speed and resource usage, potentially hindering real-time applications.