Computer Vision

#monocular-vision#robotics#cuboid-detection

CubeSLAM and ORB SLAMC++

Monocular 3D object detection and SLAM system that detects and tracks cuboids to estimate camera and object poses.

Stars885

Forks231

Last commit5 years ago

Applied Deep Learning for Computer Vision with TorchJupyter Notebook

CVPR 2015 workshop materials for learning deep learning and computer vision with Torch framework.

#educational-workshop#deep-learning#neural-networks

Stars869

Forks412

Last commit

Domain Transfer NetworkPython

TensorFlow implementation of unsupervised cross-domain image generation for transferring images between domains like SVHN to MNIST.

#deep-learning#svhn#generative-models

Stars861

Forks199

Last commit8 years ago

neural-style-ptPython

A PyTorch implementation of neural style transfer, combining the content of one image with the artistic style of another.

#generative-art#styletransfer#deep-style

Stars857

Forks168

ndt_ompC++

A multi-threaded, SSE-optimized Normal Distributions Transform algorithm for point cloud registration, offering up to 10x speedup over the original PCL implementation.

#robotics#gicp#matching

Stars850

Forks380

Awesome Deep Learning for Video Analysis

A curated collection of papers, code, and datasets for deep learning and multimodal learning in video analysis.

#video-retrieval#research-papers#deep-learning

Stars846

Forks173

Last commit4 years ago

ARKit PluginDart

A Flutter plugin for integrating Apple's ARKit framework to build augmented reality experiences on iOS.

#dart#ios#arkit

Stars842

Forks241

Last commit1 month ago

goimagehashGo

A Go library for perceptual image hashing, supporting average, difference, and perception hashing algorithms.

#ahash#hacktoberfest#hash

Stars838

Forks79

#low-light-vision#image-fusion#image-translation

LLVIPJupyter Notebook

A visible-infrared paired dataset for low-light vision tasks like pedestrian detection, image fusion, and image-to-image translation.

Stars838

Forks75

Last commit11 months ago

Awesome Scene Understanding

A curated list of research papers and resources for scene understanding in computer vision, covering 3D reconstruction, layout estimation, and primitive detection.

#layout-estimation#geometric-reasoning#research-papers

Stars820

Forks97

Last commit11 months ago

JasonLiTW/simple-railway-captcha-solver#english-versionPython

A CNN-based captcha solver for Taiwan Railway booking website with a training set generator that mimics captcha style and uses data augmentation.

#captcha-generator#cnn-keras#synthetic-data

Stars816

Forks168

Last commit

3D Bounding Box Annotation ToolTypeScript

A semi-automatic, web-based toolbox for annotating 3D bounding boxes in full-surround, multi-modal sensor data streams.

#autonomous-driving#bounding-box#pointcloud

Stars814

Forks164

#robotics#autonomous-driving#python-tool

kitti2bagPython

Convert KITTI autonomous driving datasets into ROS bag files for easy playback and integration.

Stars812

Forks266

#image-analysis#ruby-gem#haar-cascades

ruby-opencvC++

A Ruby wrapper for OpenCV, enabling computer vision and image processing in Ruby applications.

Stars811

Forks126

Last commit5 years ago

Anti-UAVPython

A benchmark and toolkit for discovering, detecting, recognizing, and tracking UAVs in the wild using RGB and thermal infrared video.

#thermal-infrared#uav-detection#object-tracking

Stars807

Forks131

#lidar#robotics#sensor-fusion

GitHub repositoryC++

A C++ library for fast ground segmentation from LiDAR point clouds using the line-fit algorithm.

Stars806

Forks154

#image-analysis#science#agricultural-technology

PlantCVPython

An open-source image analysis software package for plant phenotyping using computer vision.

A lightweight, accurate, and robust monocular visual-inertial odometry system based on a hybrid Multi-State Constraint Kalman Filter.

#robotics#sensor-fusion#ros-node

Stars803

Forks162

#autonomous-driving#simulation#inverse-perspective-mapping

Cam2BEVPython

A TensorFlow implementation for generating semantically segmented bird's eye view images from multiple vehicle-mounted cameras using a Sim2Real deep learning approach.

Public domain Java software for processing and analyzing scientific images across multiple platforms.

#microscopy#open-science#research-tools

Stars774

Forks261

Last commit2 days ago

gpdC++

Detects 6-DOF grasp poses for parallel jaw grippers in 3D point clouds, enabling robotic grasping of novel objects in clutter.

#robotics#grasp-detection#grasping

Stars764

Forks249

Last commit4 years ago

point_labelerC++

A desktop tool for labeling individual points and polygons in LiDAR point cloud datasets, specifically designed for KITTI format.

#desktop-application#point-clouds#qt

Stars749

Forks167

#robotics#sensor-fusion#rviz

Kimera-SemanticsC++

Real-time 3D semantic reconstruction library for robotics, building dense metric-semantic maps from 2D sensor data.

Stars744

Forks150

#parallel-computing#high-performance#simd

Video++C++

A C++14 header-only library for high-performance video and image processing using meta-programming and SIMD optimizations.

Stars740

Forks113

Last commit7 years ago

captcha-breakC++

A collection of CAPTCHA-breaking implementations using OpenCV, Tesseract OCR, and machine learning algorithms.

#opencv#tesseract-ocr#captcha-solving

Stars730

Forks215

Last commit7 years ago

ESIMC

An open-source simulator for event cameras, providing accurate event generation with IMU and multi-camera support.

#robotics#simulation#opengl

Stars723

Forks136

#robotics#3d-mapping#rgb-d

semantic_slamC++

Real-time 3D semantic mapping system using a handheld RGB-D camera, built on ROS with ORB_SLAM2 and PSPNet.

Stars718

Forks175

Last commit7 years ago

phashionRuby

A Ruby wrapper around the pHash library for detecting duplicate and near-duplicate images using perceptual hashing.

#duplicate-multimedia-files#image-analysis#ruby-gem

Stars711

Forks129

Last commit9 months ago

VoxelHashing: Large-scale KinectFusionC++

[Siggraph Asia 2013] Large-Scale, Real-Time 3D Reconstruction

#3d-reconstruction#kinect#computer-vision

Stars711

Forks202

Last commit5 years ago

SuironPython

An open-source machine learning system for training autonomous RC cars using computer vision and neural networks.

#robotics#neural-networks#rc-cars

Stars710

Forks77

Last commit9 years ago

Awesome Embodied Vision

A curated reading list of papers, datasets, and simulators for embodied vision research, covering navigation, interaction, and reasoning.

#robotics#simulation#research-papers

Stars705

Forks78

#bioimage-analysis#microscopy#segment-anything

MicroSAMJupyter Notebook

Interactive segmentation and tracking tools for microscopy images built on Segment Anything.

A CUDA-accelerated library collection for point cloud processing, providing GPU-optimized alternatives to PCL functions.

#robotics#cuda#pcl-alternative

Stars699

Forks105

#demo#deep-learning#android

android-yoloC++

Real-time object detection on Android using YOLO with TensorFlow, detecting 20 object classes from the Pascal VOC dataset.

Stars691

Forks212

#lidar#robotics#lidar-slam

GitHub repositoryPython

A learning-based approach for moving object segmentation in 3D LiDAR data, distinguishing moving vs. static objects in real-time.

Stars686

Forks111