Showing 36 of 589 projects
CVPR 2015 workshop materials for learning deep learning and computer vision with Torch framework.
TensorFlow implementation of unsupervised cross-domain image generation for transferring images between domains like SVHN to MNIST.
A PyTorch implementation of neural style transfer, combining the content of one image with the artistic style of another.
A multi-threaded, SSE-optimized Normal Distributions Transform algorithm for point cloud registration, offering up to 10x speedup over the original PCL implementation.
A curated collection of papers, code, and datasets for deep learning and multimodal learning in video analysis.
A Flutter plugin for integrating Apple's ARKit framework to build augmented reality experiences on iOS.
A Go library for perceptual image hashing, supporting average, difference, and perception hashing algorithms.
A visible-infrared paired dataset for low-light vision tasks like pedestrian detection, image fusion, and image-to-image translation.
A curated list of research papers and resources for scene understanding in computer vision, covering 3D reconstruction, layout estimation, and primitive detection.
A CNN-based captcha solver for Taiwan Railway booking website with a training set generator that mimics captcha style and uses data augmentation.
Convert KITTI autonomous driving datasets into ROS bag files for easy playback and integration.
A semi-automatic, web-based toolbox for annotating 3D bounding boxes in full-surround, multi-modal sensor data streams.
A Ruby wrapper for OpenCV, enabling computer vision and image processing in Ruby applications.
A C++ library for fast ground segmentation from LiDAR point clouds using the line-fit algorithm.
A lightweight, accurate, and robust monocular visual-inertial odometry system based on a hybrid Multi-State Constraint Kalman Filter.
An open-source image analysis software package for plant phenotyping using computer vision.
A TensorFlow implementation for generating semantically segmented bird's eye view images from multiple vehicle-mounted cameras using a Sim2Real deep learning approach.
Public domain Java software for processing and analyzing scientific images across multiple platforms.
A benchmark and toolkit for discovering, detecting, recognizing, and tracking UAVs in the wild using RGB and thermal infrared video.
A desktop tool for labeling individual points and polygons in LiDAR point cloud datasets, specifically designed for KITTI format.
Detects 6-DOF grasp poses for parallel jaw grippers in 3D point clouds, enabling robotic grasping of novel objects in clutter.
Real-time 3D semantic reconstruction library for robotics, building dense metric-semantic maps from 2D sensor data.
A C++14 header-only library for high-performance video and image processing using meta-programming and SIMD optimizations.
A collection of CAPTCHA-breaking implementations using OpenCV, Tesseract OCR, and machine learning algorithms.
An open-source simulator for event cameras, providing accurate event generation with IMU and multi-camera support.
Real-time 3D semantic mapping system using a handheld RGB-D camera, built on ROS with ORB_SLAM2 and PSPNet.
A Ruby wrapper around the pHash library for detecting duplicate and near-duplicate images using perceptual hashing.
An open-source machine learning system for training autonomous RC cars using computer vision and neural networks.
A curated reading list of papers, datasets, and simulators for embodied vision research, covering navigation, interaction, and reasoning.
A CUDA-accelerated library collection for point cloud processing, providing GPU-optimized alternatives to PCL functions.
Real-time object detection on Android using YOLO with TensorFlow, detecting 20 object classes from the Pascal VOC dataset.
Interactive segmentation and tracking tools for microscopy images built on Segment Anything.
Official PyTorch implementation for joint monocular 3D vehicle detection and tracking from ICCV 2019.
A learning-based approach for moving object segmentation in 3D LiDAR data, distinguishing moving vs. static objects in real-time.
A tiny JavaScript library for applying image processing filters directly in the browser.
A ROS package for calibrating camera and LiDAR sensors using OpenCV's PnP and Levenberg-Marquardt optimization.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.