An end-to-end 3D object detection network that uses deep point set networks and Hough voting to directly detect objects in point clouds.
VoteNet is a deep learning model for 3D object detection that operates directly on point cloud data. It solves the problem of accurately localizing objects in sparse 3D scans by using a Hough voting mechanism where points vote for probable object centers, enabling precise bounding box prediction without relying on 2D images or voxel grids.
Researchers and engineers working in 3D computer vision, autonomous systems, robotics, and augmented reality who need to detect objects in point cloud data from sensors like LiDAR or depth cameras.
Developers choose VoteNet because it provides state-of-the-art accuracy on real-world datasets with a simple, efficient architecture that processes raw point clouds directly, eliminating the need for hand-crafted features or intermediate representations like voxels.
Deep Hough Voting for 3D Object Detection in Point Clouds
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
VoteNet operates directly on raw point clouds without converting to voxels or 2D images, preserving 3D structure and simplifying the detection pipeline, as emphasized in the README's introduction.
Achieves high mAP scores on real-world datasets, with 57 mAP@0.25 and 32 mAP@0.5 on SUN RGB-D, outperforming previous methods using purely geometric information, as documented in the evaluation results.
Features a simple architecture with compact model size and high computational efficiency, making it suitable for deployment in resource-constrained scenarios, as highlighted in the key features.
The network is fully trainable from input point clouds to output bounding boxes, enabling seamless integration into deep learning workflows without manual feature engineering, as stated in the README.
Installation requires compiling CUDA layers for PointNet++, specific versions of Pytorch and TensorFlow, and Matlab for SUN RGB-D data preparation, creating a significant barrier to entry.
Out-of-the-box, it only supports ScanNet and SUN RGB-D; adapting to custom data requires creating a new dataset class and following tips, adding development overhead and complexity.
Tested with specific software versions (e.g., Ubuntu 18.04, CUDA 10.0), and the README notes bug fixes like gradient computation updates, indicating maintenance challenges that could affect reproducibility.