How do I install the imitation library?

Install via pip with 'pip install imitation' for the PyPI release. For development, clone the GitHub repo and use 'pip install -e .[dev]' with Python 3.8+ and gymnasium dependencies.

What's the difference between GAIL and AIRL in imitation?

GAIL directly imitates expert behavior using a discriminator, while AIRL infers a reward function adversarially. Imitation implements both with support for discrete and continuous spaces, allowing choice based on reward interpretability needs.

How to train an agent using Behavioral Cloning with imitation?

Load expert demonstrations (e.g., from gymnasium rollouts) and use the BC algorithm from the library. The quickstart.py example shows this process with minimal code for environments like CartPole.

Does imitation work with TensorFlow or older gym?

No, imitation is built on PyTorch and only supports gymnasium, not TensorFlow or the deprecated gym API. This limits compatibility with projects using those frameworks.

Can I use imitation for real-time or embedded systems?

It's designed for research and benchmarking, with a focus on reproducibility via Sacred. While code is production-ready, it may require optimization for low-latency or resource-constrained deployments.

Imitation vs stable-baselines3: which should I use?

Use imitation for tasks specifically involving demonstrations or human preferences, as it offers specialized algorithms like GAIL and AIRL. For general reinforcement learning without imitation, stable-baselines3 provides a broader set of methods.

Open-Awesome

Imitation

MITPythonv1.0.1

Clean PyTorch implementations of imitation and reward learning algorithms for reinforcement learning.

Visit Website GitHub

1.8k stars301 forks0 contributors

What is Imitation?

Imitation is a Python library that provides clean, production-ready implementations of imitation learning and reward learning algorithms. It solves the problem of training reinforcement learning agents efficiently by leveraging expert demonstrations, human preferences, or inferred reward functions, rather than relying solely on trial-and-error. The library includes algorithms like Behavioral Cloning, GAIL, AIRL, and Inverse Reinforcement Learning, all built with PyTorch.

Target Audience

Reinforcement learning researchers and practitioners who need reliable baselines for imitation learning, as well as developers working on AI systems that learn from human demonstrations or preferences, particularly in robotics and game AI.

Value Proposition

Developers choose Imitation for its high-quality, well-documented implementations that are rigorously tested and benchmarked, offering a standardized toolkit that reduces implementation errors and accelerates research. Its modular design and support for both discrete and continuous environments make it versatile for various applications.

Overview

Clean PyTorch implementations of imitation and reward learning algorithms

Use Cases

Best For

Training RL agents from expert demonstrations without hand-crafted rewards
Implementing and benchmarking state-of-the-art imitation learning algorithms
Research projects requiring reproducible baselines for imitation or reward learning
Building AI systems that learn from human preferences or comparisons
Educational purposes to understand imitation learning algorithms in depth
Developing robotics controllers using demonstration data

Not Ideal For

Teams using TensorFlow as their primary deep learning framework
Projects still reliant on the deprecated OpenAI Gym API
Applications requiring Maximum Causal Entropy IRL or Soft Q Imitation Learning with continuous action spaces
Developers needing a lightweight, no-dependency setup for rapid prototyping

Pros & Cons

Pros

Comprehensive Algorithm Suite

Implements key algorithms like GAIL, AIRL, and preference-based learning, all documented with benchmark results for reliable performance comparison.

High Code Quality

Emphasizes clean, modular, and well-tested code, ensuring stability for research and practical applications as stated in the philosophy.

Extensive Documentation

Features thorough API docs, tutorials, and benchmark summaries on ReadTheDocs, making it accessible for varied use cases.

Unified API Design

Offers consistent interfaces across algorithms, simplifying experimentation and integration with gymnasium environments.

Cons

Limited Continuous Support

Algorithms like MCE IRL and SQIL do not support continuous action spaces, restricting their utility in environments like robotics simulations.

Dependency on Gymnasium

Only compatible with the newer gymnasium API, forcing migration from the older gym, which can be a barrier for existing projects.

Complex CLI Setup

Uses Sacred for configuration, adding overhead for users unfamiliar with it or those preferring simpler script-based workflows.

Frequently Asked Questions

Related Projects

Stable Baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Stars13,395

Forks2,141

Last commit28 days ago

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Stars12,010

Forks1,357

Last commit5 days ago

Dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Stars10,879

Forks1,393

Last commit2 months ago

Tianshou

An elegant PyTorch deep reinforcement learning library.

Stars10,783

Forks1,319

Last commit2 months ago

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub

Imitation

MITPythonv1.0.1

Clean PyTorch implementations of imitation and reward learning algorithms for reinforcement learning.

Visit Website GitHub

1.8k stars301 forks0 contributors

What is Imitation?

Target Audience

Value Proposition

Overview

Clean PyTorch implementations of imitation and reward learning algorithms

Use Cases

Best For

Training RL agents from expert demonstrations without hand-crafted rewards
Implementing and benchmarking state-of-the-art imitation learning algorithms
Research projects requiring reproducible baselines for imitation or reward learning
Building AI systems that learn from human preferences or comparisons
Educational purposes to understand imitation learning algorithms in depth
Developing robotics controllers using demonstration data

Not Ideal For

Teams using TensorFlow as their primary deep learning framework
Projects still reliant on the deprecated OpenAI Gym API
Applications requiring Maximum Causal Entropy IRL or Soft Q Imitation Learning with continuous action spaces
Developers needing a lightweight, no-dependency setup for rapid prototyping

Pros & Cons

Pros

Comprehensive Algorithm Suite

Implements key algorithms like GAIL, AIRL, and preference-based learning, all documented with benchmark results for reliable performance comparison.

High Code Quality

Emphasizes clean, modular, and well-tested code, ensuring stability for research and practical applications as stated in the philosophy.

Extensive Documentation

Features thorough API docs, tutorials, and benchmark summaries on ReadTheDocs, making it accessible for varied use cases.

Unified API Design

Offers consistent interfaces across algorithms, simplifying experimentation and integration with gymnasium environments.

Cons

Limited Continuous Support

Algorithms like MCE IRL and SQIL do not support continuous action spaces, restricting their utility in environments like robotics simulations.

Dependency on Gymnasium

Only compatible with the newer gymnasium API, forcing migration from the older gym, which can be a barrier for existing projects.

Complex CLI Setup

Uses Sacred for configuration, adding overhead for users unfamiliar with it or those preferring simpler script-based workflows.

Frequently Asked Questions

Related Projects

Stable Baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Stars13,395

Forks2,141

Last commit28 days ago

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Stars12,010

Forks1,357

Last commit5 days ago

Dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Stars10,879

Forks1,393

Last commit2 months ago

Tianshou

An elegant PyTorch deep reinforcement learning library.

Stars10,783

Forks1,319

Last commit2 months ago

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub