Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Machine Learning
  3. detectron2

detectron2

Apache-2.0Pythonv0.6

A PyTorch-based platform for state-of-the-art object detection, segmentation, and visual recognition tasks.

Visit WebsiteGitHubGitHub
34.3k stars7.9k forks0 contributors

What is detectron2?

Detectron2 is a PyTorch-based library developed by Facebook AI Research for state-of-the-art computer vision tasks, primarily object detection and image segmentation. It provides a modular platform with advanced algorithms like Cascade R-CNN, PointRend, and panoptic segmentation, enabling both research experimentation and production deployment. The library serves as the successor to Detectron and maskrcnn-benchmark, offering improved training speed and extensive model support.

Target Audience

Computer vision researchers, AI engineers, and developers working on visual recognition projects who need a flexible, production-ready toolkit for object detection and segmentation tasks.

Value Proposition

Developers choose Detectron2 for its comprehensive model zoo, modular architecture that supports custom research projects, and seamless export capabilities for deployment. It combines cutting-edge algorithms with practical production features, backed by Facebook AI Research's ongoing development and community support.

Overview

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Use Cases

Best For

  • Training custom object detection models with state-of-the-art architectures
  • Implementing panoptic segmentation for complex scene understanding
  • Building research projects that require modular computer vision components
  • Deploying production vision models via TorchScript or Caffe2 exports
  • Experimenting with transformer-based detection models like ViTDet
  • Developing applications requiring human pose estimation with DensePose

Not Ideal For

  • Projects requiring out-of-the-box vision APIs with minimal setup and no model training
  • Applications deployed on edge devices with limited GPU memory and computational power
  • Teams looking for extensive beginner-friendly documentation and tutorials for simple use cases

Pros & Cons

Pros

Cutting-Edge Algorithms

Supports advanced techniques like panoptic segmentation, DensePose, and transformer-based models such as ViTDet, providing state-of-the-art performance for visual recognition tasks.

Modular Research Platform

Designed as a library to build custom projects, with a flexible architecture that facilitates experimentation and extension, as evidenced by the projects/ directory for research builds.

Seamless Production Export

Models can be easily exported to TorchScript or Caffe2 formats, enabling smooth deployment in production environments, as highlighted in the export flexibility features.

Comprehensive Model Zoo

Offers a wide range of pre-trained models and benchmarks in the Model Zoo, allowing for quick start and comparison without training from scratch.

Faster Training Speed

Optimized implementations lead to improved training efficiency compared to its predecessors, as noted in the benchmarks linked in the README.

Cons

Complex Installation and Dependencies

Requires specific PyTorch and CUDA versions with non-trivial setup, often needing careful configuration or Docker, which can be challenging for newcomers.

High Hardware Demands

Training and running models necessitate powerful GPUs with significant memory, making it unsuitable for resource-constrained setups without access to high-end hardware.

PyTorch Ecosystem Dependence

Tightly integrated with PyTorch, limiting adoption for teams using other frameworks like TensorFlow and creating vendor lock-in for production pipelines.

Frequently Asked Questions

Quick Stats

Stars34,348
Forks7,924
Contributors0
Open Issues470
Last commit16 days ago
CreatedSince 2019

Tags

#deep-learning#image-segmentation#research-platform#model-zoo#computer-vision#object-detection#pytorch

Built With

T
TorchScript
P
PyTorch

Links & Resources

Website

Included in

Machine Learning72.2kRobotic Tooling3.8k
Auto-fetched 1 day ago

Related Projects

face_recognitionface_recognition

The world's simplest facial recognition api for Python and the command line

Stars56,322
Forks13,729
Last commit1 year ago
timmtimm

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Stars36,682
Forks5,146
Last commit1 day ago
OpenposeOpenpose

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

Stars33,990
Forks8,055
Last commit1 year ago
EasyOCREasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Stars29,339
Forks3,558
Last commit4 months ago
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub