Question 1

How to install Sarek for CUDA on Linux?

Accepted Answer

Install CUDA toolkit 12.9+ and driver 575+, clone the repository, and build with 'dune build sarek-cuda'. Verify with 'dune exec -- sarek-device-info' to list CUDA devices, as per the README's prerequisites.

Question 2

Sarek vs Futhark for functional GPU programming?

Accepted Answer

Sarek integrates directly with OCaml's syntax and type system, making it ideal for existing OCaml projects, while Futhark is a standalone language optimized for pure functional arrays. Sarek offers better OCaml interoperability, but Futhark might have more advanced compiler optimizations.

Question 3

Can Sarek run on Apple Silicon Macs?

Accepted Answer

Yes, via the Metal backend for macOS 10.13+ on both Intel and Apple Silicon. Build with 'dune build sarek-metal' and select the Metal device at runtime, as documented in the backend support table.

Question 4

How to debug GPU kernels in Sarek?

Accepted Answer

Use the interpreter backend for CPU-based sequential debugging or enable debug logging with the SAREK_DEBUG environment variable. The native backend also allows parallel CPU execution for testing without GPU drivers.

Question 5

What performance benchmarks exist for Sarek?

Accepted Answer

Sarek includes a benchmark suite covering compute-bound and memory-bound patterns, with results published to an interactive web viewer. Benchmarks show competitive performance, but hand-optimized CUDA code may be faster due to abstraction overhead.

Question 6

Is Sarek suitable for machine learning applications?

Accepted Answer

While possible, Sarek lacks built-in neural network libraries, making it better for custom parallel algorithms. For ML, consider integrating with OCaml frameworks like Owl or using Python bindings, as it's not optimized out-of-the-box for deep learning.

SPOC

What is SPOC?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions