Question 1

How to migrate from Camus to Gobblin?

Accepted Answer

LinkedIn provides a detailed migration guide on the Gobblin wiki, which involves reconfiguring jobs in Gobblin to replicate Camus functionality and ensuring data consistency during the switch. It's recommended to plan the migration carefully to avoid data loss.

Question 2

Camus vs Gobblin: which one should I choose?

Accepted Answer

Gobblin is the successor to Camus with more features, active maintenance, and support for multiple sources and sinks. For new projects, Gobblin is the better choice; for existing Camus users, migration is recommended to stay supported.

Question 3

Does Camus support real-time data processing?

Accepted Answer

No, Camus is strictly batch-oriented, processing data in scheduled intervals rather than real-time, which limits its use for low-latency applications like live dashboards or alerting systems.

Question 4

Can Camus write to Amazon S3 instead of HDFS?

Accepted Answer

Camus is designed specifically for HDFS output; for S3 or other storages, you need to use alternatives like Gobblin, which has built-in support for various sinks including cloud storage.

Question 5

What are some good alternatives to Camus for Kafka to Hadoop data transfer?

Accepted Answer

Besides Gobblin, alternatives include Apache Flume for log aggregation, Apache NiFi for data flow management, or Kafka Connect with HDFS connectors for more flexible integration. The choice depends on your specific requirements like latency and ecosystem.

Question 6

Is Camus still being maintained by LinkedIn?

Accepted Answer

No, LinkedIn has officially phased out Camus and replaced it with Gobblin, so it is no longer actively maintained or supported, making it risky for production use without migration.

camus

What is camus?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions