Showing 36 of 599 projects
Official repository for Big Transfer (BiT) models, providing pre-trained visual representations for efficient transfer learning across computer vision tasks.
A curated list of resources for Document Understanding (DU), covering research, datasets, tools, and applications in Intelligent Document Processing.
A foundational PyTorch library for training deep learning models, serving as the core engine for the OpenMMLab ecosystem.
A desktop application for semi-automatic image annotation using OpenCV's watershed algorithm with manual brush refinement.
A target-less, automatic toolbox for LiDAR-camera extrinsic calibration that works with various sensor models without requiring calibration targets.
A deep learning framework for feature learning directly from point clouds using X-Conv operations, achieving state-of-the-art results in classification and segmentation.
A MATLAB toolbox implementing Convolutional Neural Networks (CNNs) for computer vision applications.
An open-source differentiable dense SLAM library for PyTorch, enabling gradient flow from map outputs to sensor inputs.
A pure PHP library for detecting and decoding QR codes without external extensions.
A fast, modular PyTorch reference implementation for training and evaluating semantic segmentation models.
A curated list of awesome CAPTCHA libraries for generation and tools for cracking them.
A curated collection of TensorFlow Lite models, sample apps, tools, and learning resources for mobile and edge AI development.
An open-source, N-dimensional image processing platform for scientific imaging with a modular, headless architecture.
A curated collection of papers, datasets, and resources for 2D/3D human pose estimation, mesh representation, and related computer vision tasks.
A deep learning technique for finding semantically-meaningful dense correspondences between images to enable visual attribute transfer.
A technique using Fourier feature mappings to enable neural networks to learn high-frequency functions in low-dimensional domains.
A curated collection of open-source computer vision pre-trained models across TensorFlow, Keras, PyTorch, Caffe, and MXNet frameworks.
Open-source flight software, simulator, and tools for NASA's Astrobee free-flying robots on the International Space Station.
Train neural networks with OpenStreetMap data and satellite imagery to classify roads and map features.
A Blender addon for importing photogrammetry and NeRF data from various reconstruction libraries and point cloud formats.
A PHP library that extracts dominant and representative colors from images using human-like perception.
Fast and robust algorithm for segmenting Velodyne LiDAR point clouds into objects for autonomous driving applications.
Scene text detection using Connectionist Text Proposal Network (CTPN) for detecting text lines in natural images.
A curated list of awesome LIDAR sensors, datasets, libraries, algorithms, and simulators for robotics and autonomous driving.
A curated list of awesome LIDAR sensors, datasets, libraries, algorithms, frameworks, and simulators for robotics and autonomous driving.
A PyTorch implementation of self-supervised monocular depth estimation using 3D packing for high-resolution, real-time depth prediction.
A library for building high-performance custom human pose estimation applications with real-time inference and flexible model development.
A curated repository of famous Vision-Language Models (VLMs) detailing their architectures, training procedures, and datasets.
A curated collection of papers, toolboxes, and notes for LiDAR-camera extrinsic calibration methods.
A curated list of resources for random forest and other tree-based machine learning methods.
Python tools for working with the KITTI autonomous driving dataset, providing data loaders and utilities for computer vision and robotics.
A curated collection of resources, papers, and frameworks for image-to-image translation research and applications.
A collection of Swift code examples demonstrating Depth APIs on iOS devices with dual or TrueDepth cameras.
A curated list of delightful npm packages that showcase surprising and innovative JavaScript capabilities.
A Python toolbox for quantitative evaluation of visual(-inertial) odometry trajectories using alignment methods and error metrics.
A deep learning system for detecting known objects and estimating their 6-DoF pose from RGB images.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.