Showing 36 of 599 projects
A C++ implementation of the Go-ICP algorithm for globally optimal 3D point cloud registration with outlier trimming.
Winning solution for the Galaxy Challenge on Kaggle, using convolutional neural networks to classify galaxy morphologies.
A single-stage 3D object detector for point clouds that improves localization precision by explicitly leveraging structure information.
A deep learning framework for detecting and localizing upper-body, lower-body, and full-body clothes in fashion images.
A TensorFlow implementation of the neural style transfer algorithm that applies artistic styles to images.
A Caffe implementation of MTCNN for joint face detection and alignment using a multi-task cascaded convolutional neural network.
Classify images offline on iOS using Watson Visual Recognition trained models and Apple's Core ML framework.
A Torch implementation of a VIS+LSTM model for answering questions about images using deep learning.
A Swift library that uses iOS 11 Vision API to automatically detect and crop faces from images.
A PyTorch implementation of TResNet, a high-performance convolutional neural network architecture optimized for GPU training and inference.
A ROS-based object detection and pose estimation library for 2D and 3D applications using OpenCV.
A deep learning library for single-cell analysis of biological images, specializing in cell segmentation and tracking.
An open-source ROS-based platform with practical exercises for learning robotics, AI, and computer vision.
An open-source, low-cost, camera-based weed detection device for precision spot spraying in agriculture.
A Python implementation for fully automatic extrinsic calibration of 3D LiDAR and cameras using laser reflectance intensity.
A real-time, uncertainty-aware deep learning model for semantic segmentation of 3D LiDAR point clouds in autonomous driving.
A 3D vision library for monocular and stereo 3D human detection, social distancing, and body orientation estimation from 2D keypoints.
A GPU-accelerated C++ library for visual-inertial odometry frontend tasks, optimized for high-speed robotics.
A Flutter plugin for barcode scanning, text recognition (OCR), and face detection using Google Mobile Vision APIs.
A ROS wrapper for the AprilTag 3 visual fiducial detector, enabling marker-based pose estimation in robotics applications.
A neural network for object detection using multi-level fusion of camera and radar data, built on Keras RetinaNet.
Utility scripts for loading, visualizing, and inspecting the KITTI-360 autonomous driving dataset.
ROS & ROS2 implementation of Patchwork++, a fast and robust ground segmentation method for 3D LiDAR point clouds.
An AI-powered captcha solver using SimGAN to generate synthetic training data without manual labeling.
An image processing library built on JAX, designed to be optimized and parallelized with JAX transformations.
A generic C++ library for image analysis and computer vision using template-based generic programming.
A comprehensive Jupyter notebook tutorial covering computer vision and machine learning basics using OpenCV and Keras in Python.
A minimalist GPU-only framework for N-dimensional convolutional neural networks focused on speed and hackability.
Lua bindings to ImageMagick and GraphicsMagick for LuaJIT using FFI, enabling image manipulation from Lua scripts.
A neural network for real-time 6D object pose tracking in video using RGB-D data, trained only on synthetic images.
A Go library for detecting nudity in images, ported from nude.js.
A lightweight neural network for near-real-time semantic segmentation of LiDAR point clouds using polar coordinate quantization.
A curated list of deep learning research papers and implementations for high dynamic range image and video synthesis.
A MATLAB/Octave toolbox for processing High Dynamic Range (HDR) images, including tone mapping and expansion operators.
A benchmark dataset and toolkit for RF-based drone detection and identification using raw IQ data and deep learning models.
A cross-platform image viewer for inspecting and rendering raw pixel data from Luminance, YUV, RGB, ARGB, and Bayer formats.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.