Showing 36 of 599 projects
An open-source tool to detect and blur faces and license plates in images for privacy compliance, using TensorFlow object detection.
A large-scale image dataset for self-supervised pretraining without humans, designed to reduce privacy concerns.
A PyTorch framework for deep learning on point clouds, providing a modular and reproducible foundation for 3D vision tasks.
A curated collection of resources on adversarial examples in deep learning, covering attacks, defenses, and applications.
A collection of pretrained deep learning models (StyleGAN2, GPT2, VGG, ResNet) for the Jax/Flax ecosystem.
Converts KITTI autonomous driving dataset raw data to ROS bags and provides a C++ library for direct data access.
An iOS library that uses face detection to calculate device distance and angle relative to a user's face for interactive 3D effects.
FLAME dataset and deep learning models for fire detection in aerial imagery using UAVs, supporting classification and segmentation tasks.
ROS package for sensor processing, object detection, tracking, and evaluation using the KITTI Vision Benchmark dataset.
A long-term autonomous driving dataset from Europe with multi-sensor data (GPS-RTK, LiDAR, cameras, IMU) for localization and mapping research.
A curated list of open-source software tools for medical imaging research, including segmentation, visualization, and deep learning libraries.
A Python library that simplifies using, finetuning, and deploying state-of-the-art machine learning models for various AI tasks.
A real-time object-level reconstruction system for 6D pose estimation using volumetric fusion and multi-object reasoning.
A Python library for interacting with FLIR thermal imaging cameras, capturing raw images, and converting proprietary file formats.
A ROS2 intelligent visual grasp solution for industrial robots, integrating OpenVINO grasp detection with MoveIt motion planning.
An iOS library that applies artistic styles to images using Core ML and pre-trained neural style transfer models.
A Python-based CAPTCHA breaking solution using Keras and OpenCV, developed for a data science competition.
A large-scale driving behavior dataset with LiDAR point clouds, dashboard videos, and sensor data for autonomous driving research.
ROS 2 packages for visual servoing and tracking using the ViSP library.
A deep learning model that reads IRCTC captchas with 98% accuracy, demonstrating their vulnerability to automated booking.
A curated collection of LiDAR place recognition methods, datasets, and algorithms for robotics and autonomous systems.
A TensorFlow CNN implementation for Chinese character recognition, achieving 92.5% top-1 accuracy with batch normalization.
A benchmark dataset and meta self-learning method for multi-source domain adaptation in scene text recognition.
A vision transformer architecture that aggregates nested local transformers on image blocks for better accuracy, data efficiency, and convergence.
A single-header, zero-allocation C library for applying fast, chainable image filters compatible with SVG and CSS semantics.
A CUDA-based implementation of KinectFusion for real-time dense surface reconstruction and tracking using a Kinect camera.
A C++14 header-only library providing generic image representations and algorithms with performance close to hand-written code.
A foundation model for cell segmentation that achieves state-of-the-art performance across diverse cellular targets and imaging modalities.
A simulation-based deep learning approach to enhance the resolution of 3D lidar point clouds for ground vehicles.
A tool for calibrating event cameras by converting event data to images and using standard image-based calibration toolboxes.
A PyTorch-based segmentation toolbox for electron microscopy connectomics, enabling neural structure analysis in 3D volumes.
Automated UI testing framework for set-top boxes and smart TVs using infrared commands and video analysis.
A Torch7 package providing extended neural network modules, criterions, and utilities for deep learning research.
A Docker container for face detection using Faster R-CNN deep learning, processing videos and images with bounding box outputs.
A simple, flexible, and extensible object-oriented template for PyTorch projects.
Automatically classifies and labels urban point clouds using data fusion with public datasets and region growing techniques.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.