Showing 36 of 599 projects
A graphical application for rapidly prototyping and deploying computer vision algorithms, primarily for robotics.
A vision transformer-based deep learning model for automated instance segmentation and classification of cell nuclei in histopathology images.
A neural network playground built in pure Swift for iOS, with no third-party dependencies.
A Python toolbox for image segmentation featuring superpixel segmentation, object center detection, and region growing with shape priors.
A neural networks toolbox for medical image analysis, providing specialized layers, models, and utilities for TensorFlow/Keras.
A re-implementation of the SPP-net object detection algorithm using spatial pyramid pooling in deep convolutional networks for visual recognition.
A real-time RGB-based pipeline for object detection and 6D pose estimation using a denoising autoencoder trained on simulated 3D views.
A curated list of deep learning resources for 3D shape processing, covering classification, reconstruction, and generation.
TensorFlow implementation of weakly-supervised object localization using only image-level labels, without bounding box annotations.
A command-line utility that uses convolutional neural networks to search and filter videos based on objects and places that appear in them.
A C++/TensorRT inference module for RangeNet++, enabling fast LiDAR semantic segmentation for robotics applications.
A fast, cross-platform image decoding library with clean APIs for C, C++, and Python, supporting 20+ formats.
A simple, multi-language implementation of the Iterative Closest Point algorithm for 3D point cloud registration.
A full LiDAR SLAM system for static environment mapping using LiDAR with optional GPS, IMU, and odometry support.
An optimized, lightweight CUDA-based 3D reconstruction implementation derived from the original Kinfu algorithm.
A minimal implementation of Deep Convolutional Generative Adversarial Networks (DCGAN) using TensorLayerX for generating realistic images.
A ROS 2 node for detecting AprilTag fiducial markers in camera images and publishing their poses and metadata.
A Rust library for calculating perceptual hash values of images using multiple algorithms.
A Python toolbox for content-aware restoration of fluorescence microscopy images using deep learning.
A JAX library implementing Lie groups for rigid body transformations in computer vision and robotics.
A Docker-based image annotation tool for bounding box labeling with auto-labeling support, designed for deep learning training.
ROS 2 wrapper for Stereolabs ZED cameras, providing access to depth, point clouds, object detection, and spatial mapping.
A Swift library for detecting and cropping faces, barcodes, and text in images using iOS 11 Vision API.
A TensorFlow-based object detection model that localizes and identifies multiple objects in images using SSD MobileNet V1 or Faster R-CNN ResNet101.
A curated list of top-tier publications and resources for LiDAR-Visual fusion SLAM systems.
A convolutional neural network for CAPTCHA recognition using Keras and PyTorch.
A ROS stack providing perception packages for 2D image and 3D point cloud processing in robotics.
A deep learning-based solution for automatically recognizing and solving 12306 railway website captchas.
A PyTorch Vision-like library for downloading, manipulating, and loading event-based/spike-based neuromorphic datasets.
An integrated framework for training custom generative AI image-to-image models using GANs, Diffusion, and Consistency Models.
A real-time fiducial tag system for LiDAR point clouds, robust to lighting and compatible with visual markers like AprilTags.
A centralized Python framework for agricultural machine learning, providing access to public datasets, benchmarks, pretrained models, and synthetic data generation.
A curated list of academic papers, datasets, and code for correcting rolling shutter effects, radial distortions, and text distortions in images and videos.
A curated list of resources for makeup and hairstyle transfer research using computer vision and generative AI.
A ROS-based system for robot localization and mapping using ceiling or floor-mounted fiducial markers.
A Python devkit for loading, exploring, and manipulating the PandaSet, a large-scale autonomous driving dataset with LiDAR, camera, and annotations.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.