Question 1

How do I set up Replicating-DeepMind on a local machine?

Accepted Answer

Setup involves configuring a GPU cluster with cuda-convnet2, as per the README. Check the project's Wiki for detailed guides, but be prepared for significant hardware and software dependencies that may require advanced system administration skills.

Question 2

What Atari games can it play and how good are the scores?

Accepted Answer

It can play classic Atari 2600 games from pixel input, but the README states it only learns slightly better than random, so don't expect high performance or competitive scores compared to modern implementations.

Question 3

Is Replicating-DeepMind better than Nathan Sprague's deep_q_rl?

Accepted Answer

The README mentions Sprague's Theano-based implementation performs fairly well. Replicating-DeepMind focuses on reproducing DeepMind's exact setup but lacks RMSprop, so Sprague's version might be more functional and complete for practical use.

Question 4

Does this include all the techniques from the DeepMind paper?

Accepted Answer

No, the README explicitly says RMSprop is missing, which is a critical optimization for deep Q-learning. This makes it an incomplete reproduction, suitable more for educational purposes than accurate replication.

Question 5

Can I use this for my own reinforcement learning projects?

Accepted Answer

Yes, as an open-source codebase, you can modify it for experiments, but be aware of limitations like missing features and slow performance. It's best for learning and tweaking rather than deploying in production environments.

Question 6

What GPU requirements are needed to run this system?

Accepted Answer

It requires a GPU cluster with cuda-convnet2, as noted in the README, meaning substantial computational resources and expertise in GPU programming are necessary for effective operation.

Reproducing the results of "Playing Atari with Deep Reinforcement Learning" by DeepMind

What is Reproducing the results of "Playing Atari with Deep Reinforcement Learning" by DeepMind?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions