Showing 3 of 3 projects
An open-source framework for building multimodal AI systems that enable large language models to understand and chat about videos and images.
An efficient neural network for semantic segmentation of large-scale 3D point clouds using random sampling.
A PyTorch implementation of self-supervised monocular depth estimation using 3D packing for high-resolution, real-time depth prediction.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.