Showing 7 of 7 projects
A comprehensive collection of PyTorch image models, layers, utilities, and training scripts for computer vision research and applications.
Official JAX/Flax implementation of Vision Transformer (ViT) and MLP-Mixer for image recognition, with pre-trained models.
A JAX library for rapid prototyping of large-scale attention-based vision models across images, video, audio, and multimodal data.
A curated collection of research papers and resources on Vision Transformers (ViT) for computer vision tasks.
A vision transformer foundation model pre-trained on over 200 million pathology images for computational pathology tasks.
A vision transformer-based deep learning model for automated instance segmentation and classification of cell nuclei in histopathology images.
A vision transformer architecture that aggregates nested local transformers on image blocks for better accuracy, data efficiency, and convergence.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.