Question 1

How to run SLM Lab experiments on cloud GPUs?

Accepted Answer

Install dstack CLI, configure a .env file with HF_TOKEN, and use 'slm-lab run-remote --gpu' with your spec file. The README details steps for automatic result sync to HuggingFace after training.

Question 2

SLM Lab vs Stable Baselines3 for reinforcement learning?

Accepted Answer

SLM Lab offers better reproducibility with JSON specs and cloud integration, while Stable Baselines3 has a larger community and more pre-trained models. Choose SLM Lab for structured research workflows; Stable Baselines3 for quick, out-of-the-box usage.

Question 3

Can SLM Lab handle custom Gymnasium environments?

Accepted Answer

Yes, any Gymnasium-compatible environment works by specifying its name in the JSON spec file, as mentioned in the Environments section. However, custom algorithm modifications may require framework tweaks.

Question 4

How to reproduce an experiment exactly with SLM Lab?

Accepted Answer

Each run saves its JSON specification and git SHA, so you can re-run the same command or use the saved spec to replicate results, leveraging the built-in reproducibility features described in the README.

Question 5

What's the best algorithm for continuous control in SLM Lab?

Accepted Answer

SAC is recommended for continuous control tasks, as validated on MuJoCo environments like HalfCheetah and Humanoid, per the algorithm table in the README. CrossQ is also noted for sample efficiency.

Question 6

How to install SLM Lab without heavy ML dependencies?

Accepted Answer

Use 'uv sync --no-default-groups' for a minimal install that skips PyTorch and Gymnasium, allowing only orchestration for dstack runs and result plotting, as detailed in the Minimal Install section.

SLM Lab

What is SLM Lab?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions