Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Machine Learning
  3. Openpose

Openpose

NOASSERTIONC++v1.7.0

Real-time multi-person keypoint detection library for body, face, hands, and foot estimation.

Visit WebsiteGitHubGitHub
34.1k stars8.0k forks0 contributors

What is Openpose?

OpenPose is an open-source library for real-time multi-person 2D and 3D keypoint detection, estimating body, face, hand, and foot keypoints from images and video. It solves the problem of accurately and efficiently detecting human pose in complex, multi-person scenarios, enabling applications in motion analysis, human-computer interaction, and augmented reality.

Target Audience

Researchers, developers, and engineers working in computer vision, robotics, human-computer interaction, and augmented reality who need robust, real-time human pose estimation.

Value Proposition

Developers choose OpenPose for its real-time performance, multi-person detection with constant runtime, and comprehensive keypoint coverage (135 keypoints across body, face, hands, and feet). Its cross-platform support and flexible APIs make it adaptable for both research and production use.

Overview

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

Use Cases

Best For

  • Real-time human pose estimation in video feeds
  • Multi-person tracking in crowded scenes
  • 3D human pose reconstruction from multiple cameras
  • Gesture recognition and hand tracking applications
  • Facial landmark detection for emotion analysis
  • Motion capture and animation from video

Not Ideal For

  • Projects deploying on low-power edge devices without GPU acceleration
  • Applications requiring only single-person pose estimation with minimal computational overhead
  • Teams needing a quick, out-of-the-box solution without complex dependency management
  • Commercial products where licensing costs from FlintBox are prohibitive

Pros & Cons

Pros

Real-Time Multi-Person Detection

Body keypoint estimation runtime is invariant to the number of people, enabling efficient processing in crowded scenes, as demonstrated in the runtime analysis comparison.

Comprehensive Keypoint Coverage

Detects 135 keypoints across body, face, hands, and feet, providing detailed pose data for advanced applications like motion analysis and gesture recognition.

Flexible Input and Output

Supports images, video, webcam, IP cameras, and custom sources, with outputs in JSON, XML, YML, or visual formats, allowing easy integration into various pipelines.

Cross-Platform and Hardware Support

Runs on Ubuntu, Windows, macOS, and Nvidia TX2, with versions for CUDA, OpenCL, and CPU-only, ensuring compatibility across different environments.

Cons

Complex Installation Process

Building from source requires managing dependencies like CUDA and OpenCV, which can be error-prone and time-consuming, as noted in the installation documentation.

Runtime Scaling for Face and Hands

Face and hand keypoint estimation runtime depends on the number of people, unlike body detection, which can degrade performance in scenes with many individuals, as admitted in the features section.

Commercial Licensing Restrictions

Non-commercial use is free, but commercial applications require purchasing a license from FlintBox, adding cost and administrative overhead for business deployments.

Frequently Asked Questions

Quick Stats

Stars34,136
Forks8,047
Contributors0
Open Issues340
Last commit1 year ago
CreatedSince 2017

Tags

#cuda#pose-estimation#opencl#opencv#deep-learning#human-pose-estimation#keypoint-detection#python-api#c-plus-plus#caffe#computer-vision#machine-learning#real-time#cpp

Built With

O
OpenCL
C
CUDA
P
Python
C
C++

Links & Resources

Website

Included in

Machine Learning72.2k
Auto-fetched 23 hours ago

Related Projects

face_recognitionface_recognition

The world's simplest facial recognition api for Python and the command line

Stars56,481
Forks13,719
Last commit1 year ago
timmtimm

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Stars36,866
Forks5,163
Last commit5 days ago
detectron2detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Stars34,531
Forks7,942
Last commit1 day ago
MLXMLX

MLX: An array framework for Apple silicon

Stars26,640
Forks1,859
Last commit23 hours ago
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub