Computer Vision

711 projects

Showing 36 of 711 projects

A curated list of papers, datasets, and code for 3D point cloud analysis research, covering classification, segmentation, detection, and more.

#autonomous-driving#point-clouds#point-cloud-classification

Stars4.2k

Forks931

Last commit3 years ago

bildGo

A collection of parallel image processing algorithms implemented in pure Go.

#resize#algorithm#graphics

Stars4.2k

Forks215

Last commit15 days ago

Nvidia DIGITS - a web app based on CaffeHTML

A web application for training deep learning models with a focus on computer vision tasks.

#model-training#deep-learning#caffe

Stars4.2k

Forks1.4k

Last commit1 year ago

hlocPython

A modular Python toolbox for state-of-the-art 6-DoF visual localization using hierarchical image retrieval and feature matching.

#pose-estimation#image-retrieval#visual-localization

Stars4.2k

Forks767

Last commit7 months ago

deepchecksPython

An open-source solution for continuous validation of machine learning models and data, from research to production.

#data-testing#ml-validation#python-library

Stars4.0k

Forks302

Last commit6 months ago

jsQR RTypeScript

A pure JavaScript library for reading QR codes from raw image data in browsers and Node.js.

#qr-parsing-library#qr-code#qr-scanner

Stars4.0k

Forks615

Last commit2 months ago

Awesome Action Recognition

A curated list of resources for action recognition, video understanding, object detection, and pose estimation in computer vision.

#pose-estimation#video-understanding#video-processing

Stars4.0k

Forks718

Last commit3 years ago

awesome-satellite-imagery-datasets

A curated list of satellite and aerial imagery datasets with annotations for computer vision and deep learning tasks.

#annotation#instance-segmentation#aerial-imagery

Stars3.9k

Forks668

Last commit4 years ago

ScenicPython

A JAX library for rapid prototyping of large-scale attention-based vision models across images, video, audio, and multimodal data.

#attention#model-training#jax

Stars3.8k

Forks479

Last commit2 days ago

libfreenectC

A cross-platform userspace driver for the Microsoft Kinect, providing access to RGB/depth images, motors, accelerometer, LED, and audio.

#motion-sensing#kinect-driver#depth-camera

Stars3.8k

Forks1.2k

Last commit1 year ago

lightly - A computer vision framework for self-supervised learningPython

A Python library for self-supervised learning on images, providing a modular PyTorch-like framework with support for modern SSL models.

#hacktoberfest#deep-learning#contributions-welcome

A procedural Blender pipeline for generating photorealistic training images for computer vision and machine learning.

#camera-positions#blender-pipeline#procedural-generation

Stars3.6k

Forks516

Last commit6 months ago

Awesome Visual Transformer

A curated collection of research papers and resources on Vision Transformers (ViT) for computer vision tasks.

#transformer#literature-review#visual-transformer

Stars3.6k

Forks404

Last commit1 year ago

Colornet - Neural Network to colorize grayscale imagesPython

A neural network that automatically adds color to grayscale images using deep learning techniques.

#neural-network#deep-learning#grayscale-to-color

A pure JavaScript OCR engine compiled from Ocrad via Emscripten for client-side text recognition in the browser.

#text-extraction#browser-ocr#webassembly

Stars3.5k

Forks380

Last commit5 years ago

pytrackingPython

A PyTorch-based framework for visual object tracking and video object segmentation, featuring implementations of state-of-the-art trackers like TaMOs, RTS, and DiMP.

#video-object-segmentation#model-training#deep-learning

Implementation of SRGAN for photo-realistic single image super-resolution using generative adversarial networks.

#vgg19#image-enhancement#super-resolution

Stars3.5k

Forks812

Last commit2 years ago

AliceVisionC++

An open-source photogrammetric computer vision framework for 3D reconstruction and camera tracking from photographs and videos.

#hdri-image#alicevision#production-pipeline

Automatic and interactive image colorization using deep neural networks, with PyTorch models for ECCV 2016 and SIGGRAPH 2017 papers.

#automatic-colorization#deep-learning#neural-networks

Stars3.5k

Forks923

Last commit2 years ago

G2O: General framework for graph optomizationC++

An open-source C++ framework for optimizing graph-based nonlinear error functions, widely used in robotics and computer vision.

#robotics#nonlinear-least-squares#open-source-framework

Stars3.4k

Forks1.2k

Last commit

realsense-ros:ros2-branchPython

A ROS2 wrapper for Intel RealSense cameras that provides depth, color, and IMU data as ROS topics and services.

#robotics#pointcloud#intel-realsense

Stars3.4k

Forks2.0k

Last commit3 days ago

Catalyst: High-level utils for PyTorch DL & RL research. It was developed with a focus on reproducibility, fast experimentation and code/ideas reusingPython

A PyTorch framework for deep learning research and development, focusing on reproducibility and rapid experimentation.

#model-training#deep-learning#automl

A hybrid Python/C++ Visual SLAM pipeline supporting monocular, stereo, and RGB-D cameras with modern features, loop closure, and dense reconstruction.

#robotics#visual-slam#global-features

Stars3.4k

Forks533

Last commit5 days ago