Question 1

How does Databus compare to Debezium for change data capture?

Accepted Answer

Databus is LinkedIn's in-house CDC system focused on Oracle and MySQL with strong consistency guarantees, while Debezium is an open-source, Kafka-based CDC with broader database support and active community. Databus may be more tailored for large-scale enterprise use, but Debezium offers easier integration and updates.

Question 2

How to set up Databus with MySQL for real-time data streaming?

Accepted Answer

To set up Databus with MySQL, configure the relay to mine MySQL's binary logs by adapting the example configurations. The README provides an Oracle example, but you'll need to refer to the wiki for MySQL-specific details, which may involve custom log mining adapters and careful tuning.

Question 3

Is Databus still actively maintained and updated?

Accepted Answer

The GitHub repository shows some activity, but it's based on older technology from 2012, and the documentation references a wiki that might be outdated. While used in production at LinkedIn, community-driven updates are limited compared to newer alternatives.

Question 4

Can Databus handle schema changes in source databases?

Accepted Answer

Databus captures data changes from transaction logs, but handling schema changes depends on the implementation. The README doesn't specify, so you may need to build custom logic in downstream applications to manage schema evolution, which can add complexity.

Question 5

What are the scalability limits of Databus for high-throughput systems?

Accepted Answer

According to the README, Databus handles thousands of events per second per server with low latency, but scaling requires distributed deployment and tuning. For exact limits, refer to the 2012 ACM paper, but real-world performance depends on infrastructure and configuration.

Question 6

How to subscribe to specific tables or data streams in Databus?

Accepted Answer

Use Databus's rich subscription functionality by configuring clients to subscribe to specific tables or views, as demonstrated in the PersonClientMain example. Define subscriptions via APIs to filter and route change events based on your needs.

databus

What is databus?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions