Showing 3 of 3 projects
A JAX library for rapid prototyping of large-scale attention-based vision models across images, video, audio, and multimodal data.
A curated collection of research papers and resources on Vision Transformers (ViT) for computer vision tasks.
A deep reinforcement learning framework for crowd-aware robot navigation using attention mechanisms to model human-robot and human-human interactions.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.