Showing 9 of 9 projects
An open-source framework for building multimodal AI systems that enable large language models to understand and chat about videos and images.
An efficient neural network for semantic segmentation of large-scale 3D point clouds using random sampling.
A PyTorch implementation of self-supervised monocular depth estimation using 3D packing for high-resolution, real-time depth prediction.
A PyTorch implementation of Social GAN for predicting socially acceptable human trajectories using generative adversarial networks.
A single-stage 3D object detector for point clouds that improves localization precision by explicitly leveraging structure information.
A lightweight neural network for near-real-time semantic segmentation of LiDAR point clouds using polar coordinate quantization.
A large-scale driving behavior dataset with LiDAR point clouds, dashboard videos, and sensor data for autonomous driving research.
A TensorFlow implementation of the Mnemonic Descent Method for end-to-end face alignment.
A 3D object detection method that exploits visibility information from LiDAR point clouds to improve accuracy.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.