A curated collection of research papers and resources on Vision Transformers (ViT) for computer vision tasks.
Awesome Visual-Transformer is a curated GitHub repository that collects and organizes academic papers, surveys, and blogs related to Transformer models applied to computer vision. It addresses the challenge of keeping up with the fast-paced research in Vision Transformers by providing a centralized, community-updated resource. The repository spans foundational works, novel architectures, and applications across tasks like image recognition, object detection, and medical imaging.
AI researchers, computer vision practitioners, graduate students, and engineers who need a structured overview of Vision Transformer literature to inform their work or stay current with advancements.
It saves significant time in literature review by aggregating a vast number of relevant papers in one place, often with direct links to code and preprints. Unlike generic paper lists, it is specifically focused on the intersection of Transformers and vision, and is actively maintained by the community for completeness.
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Aggregates hundreds of papers from top conferences like CVPR, ICCV, and arXiv, spanning foundational models to recent advances, as evidenced by the extensive lists from 2021-2023.
Organizes resources by year, conference, and application areas such as detection and segmentation, making navigation efficient for targeted research.
Encourages contributions via issues and pull requests, as stated in the README, helping keep the list current with community input.
Many entries provide direct links to official implementation code, such as for Swin Transformer and DETR, facilitating reproducibility and experimentation.
Lists papers without curation or evaluation, leaving users to manually assess the significance and reliability of each work, which can be time-consuming.
Depends on community submissions, leading to potential gaps or delays in adding the latest research, as updates are not automated or guaranteed.
Provides only titles and links, missing abstracts, key findings, or comparative analyses that would aid in quick understanding and decision-making.