Question 1

How do I copy a tensor from CPU to GPU in cutorch?

Accepted Answer

Use the :cuda() method on a CPU tensor, like dest = src:cuda(). This allocates the tensor on the current GPU device, with the GPU ID set via cutorch.setDevice.

Question 2

Cutorch vs PyTorch for GPU acceleration: which is better?

Accepted Answer

Cutorch is specific to Torch7 (Lua) and ideal for legacy projects, while PyTorch (Python) offers a more modern, actively maintained ecosystem. Choose cutorch only if you're deeply invested in the Torch7 framework.

Question 3

How to enable multi-GPU support in cutorch?

Accepted Answer

Use functions like cutorch.setDevice to switch between GPUs and cutorch.withDevice for scoped operations. Peer-to-peer access can be enabled with cutorch.setPeerToPeerAccess for efficient inter-GPU communication.

Question 4

What are the main limitations of cutorch compared to CPU tensors?

Accepted Answer

Cutorch lacks full support for all tensor operations on non-float types, and some API inconsistencies exist (e.g., return types for min/max). The README lists missing features in issue #70.

Question 5

Is cutorch still actively maintained?

Accepted Answer

Cutorch is part of the older Torch7 ecosystem, which has largely been superseded by PyTorch. While functional, updates may be limited, and users should expect community support to be sparse.

Question 6

How to manage memory allocation to avoid slowdowns?

Accepted Answer

Set the environment variable THC_CACHING_ALLOCATOR=1 to enable the caching allocator, which reuses memory segments and reduces costly cudaFree synchronizations, as described in the memory management section.

cutorch

What is cutorch?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions