Computer Vision

universal-data-toolJavaScript

Forks241

Last commit5 years ago

A web/desktop application for collaborative labeling and annotation of images, text, audio, documents, and other data types.

#dataset-creation#desktop-app#web-app

#c-library#image-analysis#document-analysis

Forks200

Last commit1 year ago

LeptonicaC

A C library for efficient image processing and analysis, widely used in OCR and computer vision applications.

#hash#perceptual-hashes#php-library

Forks433

Last commit12 days ago

Image HashPHP

A PHP library for generating perceptual image hashes to detect similar or duplicate images.

#ai#python-library#facial-landmarks

Forks174

Last commit10 months ago

retinafacePython

A deep learning-based facial detection library for Python with facial landmark extraction.

SfMLearnerJupyter Notebook

Forks197

Last commit1 month ago

An unsupervised learning framework for depth and ego-motion estimation from monocular videos using TensorFlow.

#autonomous-driving#kitti-dataset#deep-learning

Forks555

Semantic Segmentation EditorJavaScript

A web-based labeling tool for creating semantic segmentation training data from 2D images and 3D point clouds.

#labeling-tool#autonomous-driving#image-labeling

Open source Python module for computer visionPython

Forks448

Last commit

A pure Python computer vision library based on the book 'Programming Computer Vision with Python'.

#open-source#python-library#educational

#robotics#autonomous-vehicles#localization

Forks679

Last commit5 years ago

Awesome SLAM datasets

A curated collection of datasets for Simultaneous Localization and Mapping (SLAM) research, categorized by topic, platform, and environment.

#edge-detection#deep-learning#object-boundary

Forks345

Last commit1 year ago

[Web]C++

A deep learning-based edge detection algorithm using holistically-nested fully convolutional neural networks.

#python-library#scipy#numba

Forks535

Last commit2 years ago

pymattingPython

A Python library implementing multiple alpha matting algorithms for extracting foreground objects from images.

Lip Reading - Cross Audio-Visual Recognition using 3D Architectures in TensorFlowPython

Forks225

Last commit3 months ago

:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

#3d-convolutional-network#deep-learning#speech-recognition

A computer vision library for human-computer interaction, focusing on head pose estimation, gaze direction, skin detection, motion tracking, and saliency mapping using CNNs.

#saliency-mapping#histogram-comparison#skin-detection

An end-to-end deep learning system for reconstructing complete 3D scenes (geometry and semantics) from posed 2D images.

#deep-learning#scene-understanding#3d-reconstruction

Forks216

#hacktoberfest#thumbnail-generation#go-library

smartcropGo

A pure Go implementation that finds optimal image crops for arbitrary aspect ratios using content-aware analysis.

Forks117

#robotics#autonomous-driving#evaluation-metrics

AB3DMOTPython

A real-time baseline 3D multi-object tracking system using LiDAR point clouds, combining 3D Kalman filter and Hungarian algorithm.

#image-mask#cutout#objective-c

Forks416

Last commit2 years ago

TinyCrayonSwift

A smart and easy-to-use image masking and cutout SDK for iOS and iPadOS mobile applications.

Forks154

#lidar#robotics#point-clouds

GitHub repositoryC++

A modular C++ library implementing the Iterative Closest Point (ICP) algorithm for aligning 2D and 3D point clouds in robotics and computer vision.

#image-filters#graphics#go-library

Forks566

Last commit8 months ago

giftGo

A pure Go library providing a comprehensive set of image processing filters with no external dependencies.

#robotics#3d-environment#sim2real

Forks122

Last commit2 years ago

AI2-THORC#

An open-source, near photo-realistic 3D simulation platform for training and evaluating embodied AI agents.

#robotics#point-clouds#sun-rgb-d

Forks296

Last commit8 months ago

VotenetPython

An end-to-end 3D object detection network that uses deep point set networks and Hough voting to directly detect objects in point clouds.

Forks389

lidar_camera_calibrationC++

A ROS package for extrinsic calibration between LiDAR and camera sensors using 3D-3D point correspondences.

#lidar#robotics#sensor-fusion

#lidar#pointcloud#autonomous-robots

Forks474

Last commit9 months ago

loam_velodyneC++

A realtime LiDAR odometry and mapping (LOAM) method for state estimation and mapping using 3D lidar sensors like Velodyne VLP16.

react-native-arkitObjective-C

Forks959

Last commit7 years ago

React Native binding for iOS ARKit, enabling augmented reality app development with 3D components and plane detection.

#ios#arkit#objective-c

Forks139

#iot#camera#traffic-analysis

opendatacamJavaScript

An open-source computer vision tool that detects, tracks, and counts moving objects from cameras and videos.

#opencv#image-processing#tensorflow

Forks300

Last commit3 months ago

YOLO TensorFlowPython

TensorFlow implementation of YOLO for real-time object detection using pretrained YOLO_small, YOLO_tiny, and YOLO_face models.

Awesome-Interaction-aware-Trajectory-PredictionTeX

Forks638

Last commit7 years ago

A curated checklist of state-of-the-art research materials (datasets, papers, code) for interaction-aware trajectory prediction.

#robotics#autonomous-driving#research-datasets

#robotics#autonomous-driving#point-clouds

Forks308

Last commit

GitHub repositoryPython

A deep learning pipeline for 3D object detection from RGB-D data by combining 2D detectors with PointNet-based 3D processing.

A collection of high-performance GICP-based point cloud registration algorithms with multi-threaded and GPU-accelerated implementations.

#robotics#gicp#icp

#c-library#superpixels#matlab-toolbox

Forks371

Last commit1 year ago

VLFeatC

An open-source C library with MATLAB interfaces implementing popular computer vision algorithms for image understanding and local feature extraction.

Forks620

#geometry-processing#viewer#mesh

Easy3DC++

A lightweight C++/Python library for 3D data processing, geometry algorithms, and rendering with an easy-to-use API.

#robotics#open-source#3d-reconstruction

Forks275

Last commit3 months ago

ORB-SLAMC++

A real-time monocular SLAM system for computing camera trajectories and sparse 3D scene reconstruction.

Forks818

#model-training#deep-learning#fine-tuning

LightlyTrainPython

An all-in-one framework for training state-of-the-art computer vision models, covering pretraining, fine-tuning, and distillation.