Showing 11 of 11 projects
A fast open framework for deep learning with a focus on expression, speed, and modularity.
An open-source AI memory tool that records your screen and audio locally, enabling search and automation agents based on your computer activity.
An open-source AI memory tool that captures your screen and audio locally, enabling search and automation agents based on your computer activity.
A Swift library that uses iOS 11 Vision API to automatically detect and crop faces from images.
A ROS wrapper for the AprilTag 3 visual fiducial detector, enabling marker-based pose estimation in robotics applications.
A graphical application for rapidly prototyping and deploying computer vision algorithms, primarily for robotics.
A Swift library for detecting and cropping faces, barcodes, and text in images using iOS 11 Vision API.
A vision transformer architecture that aggregates nested local transformers on image blocks for better accuracy, data efficiency, and convergence.
A Python package providing popular computer vision model architectures built with Equinox for JAX.
An application that uses IBM Watson AI services and Cloud Functions to analyze videos, extracting visual and audio insights for search and categorization.
A macOS app that automatically extracts and annotates facial landmarks from videos for GAN training datasets.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.