Computer Vision

#canvas#photo-filters#javascript-library

lena.jsJavaScript

A tiny JavaScript library for applying image processing filters directly in the browser.

Stars679

Forks84

Last commit3 years ago

Illustration2VecPython

A deep learning library for tag estimation and semantic feature vector extraction from illustrations.

#chainer#python-library#illustration-analysis

Stars677

Forks113

DVO: dense visual odometryC++

A dense visual odometry and SLAM system for RGB-D cameras that estimates camera motion from consecutive depth images.

#robotics#pose-estimation#rgbd-cameras

Stars667

Forks302

Last commit9 years ago

LucentPython

A PyTorch adaptation of Lucid for visualizing and interpreting neural networks through feature visualization.

#neural-network-visualization#deep-learning#research-tools

Stars663

Forks95

#robotics#perception#opencv

vision_opencvC++

Bridge between ROS 2 and OpenCV for real-time computer vision applications.

Stars661

Forks646

Last commit6 months ago

vision_opencvC++

Bridge between ROS 2 and OpenCV for real-time computer vision applications.

#robotics#perception#opencv

Stars661

Forks646

Last commit6 months ago

Awesome-Torch (Repository on GitHub)

A curated list of awesome Torch tutorials, projects, libraries, and communities for deep learning.

#deep-learning#neural-networks#research-tools

Stars653

Forks138

Last commit8 years ago

cloverC++

ROS-based framework and Raspberry Pi image for controlling PX4-powered drones, enabling easy autonomous flight development.

#robotics#px4#autonomous-drones

Stars651

Forks316

Last commit3 months ago

ZeroCostDL4MicJupyter Notebook

A free Google Colab-based toolbox with Jupyter notebooks and GUI for applying deep learning to microscopy data without coding expertise.

#google-colab#scientific-computing#image-analysis

Stars648

Forks143

Last commit

Awesome Video Text Retrieval

A curated list of deep learning resources for video-text retrieval, including papers, implementations, and datasets.

#video-retrieval#cross-modal-retrieval#research-papers

Stars644

Forks69

Last commit2 years ago

GitHub repositoryC++

A LiDAR-based tool for constructing static maps by removing dynamic points from point cloud sequences.

#lidar#robotics#static-mapping

Stars643

Forks114

Last commit3 months ago

pptkC++

A Python package for visualizing and processing 2D/3D point clouds with interactive rendering and parallelized queries.

#gps-data#lidar#scientific-visualization

Stars634

Forks112

livox_camera_lidar_calibrationC++

A ROS-based tool for manually calibrating extrinsic parameters between Livox LiDAR sensors and cameras using board corners.

#robotics#ceres-solver#pcl

Stars634

Forks151

Last commit4 years ago

Detecto - Train and run object detection models with 5-10 lines of codePython

Build fully-functioning computer vision and object detection models with PyTorch in just 5 lines of code.

#transfer-learning#model-training#python-library

A whole-slide foundation model for digital pathology, pre-trained on real-world data to analyze tissue slides at tile and slide levels.

#research-tool#medical-ai#tile-encoder

Stars626

Forks107

Last commit4 days ago

dataset-apiJupyter Notebook

A toolkit and dataset for autonomous driving research, including trajectory prediction, 3D LiDAR detection, scene parsing, and video inpainting.

#lidar#autonomous-driving#research-toolkit

Stars619

Forks141

Last commit3 months ago

OverFeatC

A convolutional network-based image classifier and feature extractor trained on ImageNet, providing dense feature extraction capabilities.

#research-tool#deep-learning#imagenet

Stars602

Forks198

Last commit12 years ago

GAN-CLSPython

TensorFlow implementation of GAN-CLS algorithm for generating images from text descriptions using adversarial networks.

#text-to-image#tensorlayer#deep-learning

Stars599

Forks161

Food-Recipe-CNNJupyter Notebook

A deep learning system that classifies food images into 230 categories and retrieves matching recipes using convolutional neural networks.

#inceptionv3#transfer-learning#food-recognition

A curated collection of open-source machine learning models compatible with Apple's Core ML framework.

#ai#coremltools#ios

Stars587

Forks63

Last commit6 years ago

VLogPython

A video-language understanding framework that treats video narration as vocabulary and videos as long documents for efficient analysis.

#cvpr-2025#video-understanding#vocabulary

Stars587

Forks31

#autonomous-driving#kitti-dataset#deep-learning

SqueezeSegPython

A convolutional neural network model for real-time road-object segmentation from 3D LiDAR point clouds.

Stars574

Forks240

#lidar#robotics#camera-fusion

CamVoxC++

A low-cost and accurate SLAM system that fuses Livox lidar with camera data for robust localization and mapping.

Stars566

Forks117

Last commit4 years ago

unity-sdkC#

A Unity SDK for integrating IBM Watson AI services like speech, language, and vision into games and applications.

#unity3d#hacktoberfest#csharp

Stars565

Forks205

#lidar#monocular-vision#autonomous-driving

DDADPython

A benchmark dataset for long-range (up to 250m) dense depth estimation in autonomous driving, featuring 360° LiDAR ground truth.

Stars558

Forks54

cnn_handwritten_chinese_recognitionPython

A web application that uses a CNN model to recognize handwritten Chinese characters from an online drawing canvas.

#flask#deep-learning#python

A comprehensive image processing library for Julia, providing tools for loading, manipulating, and analyzing images.

#scientific-computing#image-analysis#julia

Stars551

Forks141

Last commit2 months ago

what the thing?JavaScript

Android app that uses your camera to identify objects and translate their names into different languages.

#camera#language-translation#android

Stars544

Forks73

Awesome Computer Vision Models

A curated list of popular deep learning models for image classification, segmentation, and detection with key performance metrics.

#machine-learning-algorithms#deep-learning#neural-networks

Stars542

Forks93

#fmx-components#free-pascal#opencv-bindings

Delphi-OpenCVPascal

Delphi and Free Pascal bindings for OpenCV 2.4.13, enabling computer vision development in Object Pascal.

Stars538

Forks232

Last commit1 month ago

ARKit EmperorSwift

A collection of practical ARKit 2.0 sample projects for iOS developers, featuring drawing, 3D modeling, physics, and face detection.

#ios12#ios#arkit

Stars536

Forks58

#tensorlayer#deep-learning#brain-tumor

U-NetPython

A U-Net implementation for brain tumor segmentation using the BRATS 2017 dataset with data augmentation and dice loss.

Stars536

Forks179

#cityscape-dataset#deep-learning#real-time-inference

LEDNetPython

A lightweight encoder-decoder neural network for real-time semantic segmentation on resource-constrained devices.

Stars522

Forks79