Showing 20 of 20 projects
Open Source Computer Vision Library providing real-time image processing and AI capabilities.
An open-source library with over 2500 optimized algorithms for real-time computer vision and machine learning.
An open-source Automatic License Plate Recognition library written in C++ with bindings for multiple programming languages.
A fully convolutional neural network for real-time instance segmentation, achieving high speed and accuracy on COCO.
A PyTorch-based research platform implementing state-of-the-art single object tracking algorithms like SiamRPN and SiamMask.
OpenCV bindings for Node.js enabling real-time computer vision applications in JavaScript.
A curated list of resources for action recognition, video understanding, object detection, and pose estimation in computer vision.
An open-source framework for building multimodal AI systems that enable large language models to understand and chat about videos and images.
PyTorch implementation of FlowNet 2.0 for optical flow estimation using deep neural networks.
A JavaScript library for parsing, segmenting, and extracting samples from MP4 files in the browser and Node.js.
A CVPR 2018 algorithm for efficient multi-person pose estimation and tracking in videos, ranking first in the ICCV 2017 PoseTrack challenge.
A curated collection of papers, code, and datasets for deep learning and multimodal learning in video analysis.
A benchmark and toolkit for discovering, detecting, recognizing, and tracking UAVs in the wild using RGB and thermal infrared video.
A video-language understanding framework that treats video narration as vocabulary and videos as long documents for efficient analysis.
Open-source quality control tool for analyzing digitized video files through audiovisual analytics and filtering.
A command-line utility that uses convolutional neural networks to search and filter videos based on objects and places that appear in them.
Automated UI testing framework for set-top boxes and smart TVs using infrared commands and video analysis.
A distributed video processing platform built on Apache Storm with OpenCV integration for large-scale computer vision pipelines.
A TensorFlow implementation of hierarchical attentive recurrent neural networks for single object tracking in videos.
An application that uses IBM Watson AI services and Cloud Functions to analyze videos, extracting visual and audio insights for search and categorization.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.