Question 1

How do I add a custom big data tool to Ferry?

Accepted Answer

You need to create a Dockerfile for the tool and possibly a configuration module. The README suggests contributing by creating Dockerfiles or hacking Ferry for new backends, but it requires effort and familiarity with Docker and Python.

Question 2

Ferry vs Docker Compose for big data clusters?

Accepted Answer

Ferry is specialized for big data stacks with pre-built configurations, simplifying common setups like Hadoop and Spark. Docker Compose is more general-purpose and might require more manual configuration for complex big data workflows.

Question 3

Can I use Ferry for production deployments?

Accepted Answer

Ferry is geared towards development and testing, as stated in the philosophy. For production, you might need to migrate to managed services or manual setups due to potential limitations in scalability, support, and outdated software versions.

Question 4

How to share my Ferry cluster with a teammate?

Accepted Answer

Share the YAML configuration files and Dockerfiles. Ferry facilitates this by using Dockerfiles for replication, allowing others to recreate the same environment locally or on their infrastructure with minimal setup.

Question 5

What versions of Hadoop and Spark does Ferry support?

Accepted Answer

According to the README, Ferry supports Hadoop 2.5.1 and Spark 1.1.0, which are specific, older versions. Check the documentation for any updates, but it may not include the latest releases.

Question 6

How to install Ferry on a Mac or Windows?

Accepted Answer

Ferry is a Python app that requires Docker. On Mac or Windows, install Docker Desktop and use pip to install Ferry, but detailed OS-specific instructions might be sparse in the provided documentation link.

Question 7

Does Ferry work with Kubernetes?

Accepted Answer

Ferry uses Docker directly and is not integrated with Kubernetes. If you need orchestration with Kubernetes, you'll need to look for other tools or manually adapt the configurations, as Ferry focuses on local and cloud VM deployments.

ferry

What is ferry?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions