Question 1

How to speed up uploading thousands of small files to S3?

Accepted Answer

Use s3-parallel-put with the --processes option to set multiple parallel uploads. It's designed specifically for this by distributing files across processes, reducing overall time compared to sequential tools.

Question 2

s3-parallel-put vs AWS CLI sync: which is better for batch uploads?

Accepted Answer

s3-parallel-put often outperforms AWS CLI for many small files due to its true parallel processing, while AWS CLI might offer more features like incremental sync based on timestamps. However, s3-parallel-put is simpler and faster for raw upload speed.

Question 3

How to resume a failed upload with s3-parallel-put?

Accepted Answer

Run the command with --resume and the log filename from the previous attempt. The tool reads the log to skip already uploaded files, allowing you to pick up where it left off without re-uploading.

Question 4

Does s3-parallel-put work with Python 3?

Accepted Answer

No, it currently depends on Python 2.X, as stated in the dependencies. This limits its use on systems that have migrated to Python 3, requiring potential workarounds or forks.

Question 5

Can s3-parallel-put compress files during upload?

Accepted Answer

Yes, use the --gzip option to compress text files and set the Content-Encoding header. You can customize which file types to compress with --gzip-type, such as for SVG or other specific content types.

Question 6

How to test uploads without actually sending files to S3?

Accepted Answer

Use the --dry-run option along with --limit to simulate the upload process. This prints the actions that would be taken, helping verify paths and settings before executing the transfer.

s3-parallel-put

What is s3-parallel-put?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions