How does Tianshou compare to Stable-Baselines3 for reinforcement learning?

Tianshou offers more modularity and a dual API design, making it better for algorithm research and offline RL, while Stable-Baselines3 is more polished for applied use with a simpler interface. Tianshou also boasts higher code coverage and extensive testing for reproducibility.

How to install Tianshou with MuJoCo environment support?

Use poetry with the 'mujoco' extra, e.g., 'poetry install --extras mujoco', or install from source with pip and ensure MuJoCo dependencies are separately configured. The README details this in the installation section, noting compatibility issues with newer macOS.

Does Tianshou support multi-GPU training?

Yes, Tianshou supports multi-GPU training, as mentioned in the usage documentation. It integrates with PyTorch's distributed capabilities to accelerate training on hardware clusters, though setup may require additional configuration.

What offline RL algorithms are available in Tianshou?

Tianshou includes algorithms like Conservative Q-Learning (CQL), Batch-Constrained deep Q-Learning (BCQ), and TD3+BC, along with discrete variants. These are suitable for training from fixed datasets without environment interaction.

How to implement a custom environment in Tianshou?

Tianshou supports any environment state/action type via Gymnasium integration. Define a custom environment following Gymnasium's API, then use it with Tianshou's collectors; the documentation provides examples for user-defined environments and complex state representations.

Is Tianshou compatible with TensorFlow or other deep learning frameworks?

No, Tianshou is built purely on PyTorch, so it doesn't natively support TensorFlow. This limits its use for teams invested in other frameworks, though PyTorch's popularity makes it widely adoptable for RL projects.

Open-Awesome

Tianshou

MITPythonv2.0.1

An elegant PyTorch-based deep reinforcement learning library with modular APIs for both research and application development.

Visit Website GitHub

10.8k stars1.3k forks0 contributors

What is Tianshou?

Tianshou is a deep reinforcement learning library built on PyTorch that provides a modular framework for developing, training, and evaluating RL agents. It solves the problem of complex, inflexible codebases in RL by offering both user-friendly high-level APIs for applications and hackable low-level interfaces for algorithm research, supporting a wide range of algorithms from online DQN to offline CQL.

Target Audience

Reinforcement learning researchers seeking a flexible, type-safe library for algorithm development, and practitioners needing a high-performance, easy-to-use toolkit for applying RL to custom environments.

Value Proposition

Developers choose Tianshou for its clean separation between algorithms and policies, extensive algorithm coverage, and dual API design that balances ease of use with research flexibility, all while maintaining high software engineering standards with thorough testing and documentation.

Overview

An elegant PyTorch deep reinforcement learning library.

Use Cases

Best For

Implementing and experimenting with novel reinforcement learning algorithms
Training RL agents on custom environments with vectorized parallelization
Developing applications using offline reinforcement learning methods
Multi-agent reinforcement learning research and prototyping
Reproducible RL experiments with rigorous testing and benchmarking
Educational purposes for learning deep RL concepts with clean PyTorch code

Not Ideal For

Projects relying on non-Gymnasium environment standards or legacy reinforcement learning frameworks
Teams needing a production-stable API without breaking changes, especially after major version updates
Beginners or educators seeking plug-and-play RL solutions with minimal configuration and drag-and-drop interfaces
Applications requiring extensive pre-trained models or industry-specific adaptations out-of-the-box

Pros & Cons

Pros

Dual API Flexibility

Offers both a high-level API for easy application development and a procedural API for hackable algorithm research, as demonstrated in the quick start examples with CartPole.

Extensive Algorithm Library

Implements over 30 state-of-the-art RL algorithms including DQN variants, PPO, SAC, and offline methods like CQL, covering online, offline, and multi-agent scenarios.

Performance Optimizations

Supports vectorized environments and EnvPool integration for accelerated training, with numba-optimized operations for experience replay and GAE, ensuring high-speed execution.

High Code Quality

Maintains rigorous testing with full agent training procedures, type hints, and comprehensive documentation, as highlighted in the comparison table with other RL platforms.

Cons

Breaking Changes in Updates

Version 2 is a complete overhaul that is not backwards compatible, requiring migration efforts for existing projects, as explicitly noted in the README's change log warning.

Complex Installation Process

Requires poetry for full feature installation and managing extras for different environment types like mujoco or atari, which can be cumbersome compared to simpler pip-based setups.

Steeper Learning Curve

The procedural API demands deeper understanding of RL concepts and PyTorch, making it less accessible for newcomers despite the high-level API's ease of use.

Experimental Multi-Agent Support

Multi-agent RL features are labeled as experimental in the documentation, meaning they may be less stable or well-documented compared to core algorithms.

Frequently Asked Questions

Related Projects

Stable Baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Stars13,395

Forks2,141

Last commit28 days ago

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Stars12,010

Forks1,357

Last commit5 days ago

Dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Stars10,879

Forks1,393

Last commit2 months ago

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Stars9,918

Forks1,098

Last commit1 month ago

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub