Question 1

How do I run Paxml on NVIDIA GPUs instead of TPUs?

Accepted Answer

Use the NVIDIA Rosetta repository for optimized GPU versions with H100 FP8 support, as the main README notes GPU support is handled separately, requiring installation from that fork rather than the standard PyPI package.

Question 2

What's the difference between Paxml and MaxText for training language models?

Accepted Answer

Paxml is a general ML framework with configurable experiments, while MaxText is a specific implementation for text models; Paxml can import MaxText configs via translation tables, offering more flexibility but added complexity for pure LLM workloads.

Question 3

How to configure a custom neural network layer in Paxml?

Accepted Answer

Define a subclass of base_layer.BaseLayer with HParams as a dataclass, implementing setup and fprop methods, and use Fiddle for shared parameters, as detailed in the layers section with examples like Linear and FeedForward.

Question 4

Why would I choose Paxml over PyTorch for large model training?

Accepted Answer

Paxml excels in achieving high hardware utilization on TPUs with advanced parallelization, making it ideal for cutting-edge efficiency on Google Cloud, but it lacks PyTorch's broad ecosystem and ease of use for general-purpose or small-scale projects.

Question 5

How to handle custom training data formats in Paxml?

Accepted Answer

Implement a custom BaseInput subclass using tf.data or SeqIO, or modify existing inputs by overriding get_next(); the README provides examples for SeqIO and Lingvo inputs with sharding best practices.

Question 6

What should I do if I get dependency errors during Paxml installation?

Accepted Answer

Refer to the requirements.txt file in the specific release branch (e.g., rX.Y.Z) for exact transitive dependency versions, as the README warns about issues in native Cloud TPU VM environments and provides this workaround.

Pax

What is Pax?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions