Showing 3 of 3 projects
A curated list of resources for action recognition, video understanding, object detection, and pose estimation in computer vision.
An open-source framework for building multimodal AI systems that enable large language models to understand and chat about videos and images.
A video-language understanding framework that treats video narration as vocabulary and videos as long documents for efficient analysis.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.