Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Integration
  3. Debezium (k)

Debezium (k)

Apache-2.0Java

A low-latency platform for change data capture (CDC) that streams row-level changes from databases to applications.

Visit WebsiteGitHubGitHub
12.6k stars2.9k forks0 contributors

What is Debezium (k)?

Debezium is an open-source change data capture (CDC) platform that monitors databases and streams every row-level change as a real-time event. It solves the problem of reliably capturing and reacting to database changes without impacting source systems, providing a unified model across different databases. This enables applications to stay synchronized with database state changes efficiently.

Target Audience

Developers and architects building event-driven microservices, real-time data pipelines, or systems requiring reliable data synchronization across services. It's particularly valuable for teams implementing CQRS, cache invalidation, or migrating from monolithic to distributed architectures.

Value Proposition

Developers choose Debezium for its robust, fault-tolerant architecture built on Kafka, its database-agnostic approach to CDC, and its ability to guarantee exactly-once event delivery. Its unique selling point is providing a production-ready, scalable CDC solution that abstracts away the complexities of database-specific change capture mechanisms.

Overview

Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.

Use Cases

Best For

  • Real-time cache invalidation and synchronization
  • Breaking down monolithic applications into event-driven microservices
  • Building reliable data integration pipelines between heterogeneous systems
  • Implementing CQRS (Command Query Responsibility Separation) architectures
  • Creating audit logs or change history for database transactions
  • Enabling cross-service data sharing without tight coupling

Not Ideal For

  • Projects using databases without built-in CDC support or not covered by Debezium connectors
  • Teams needing simple, infrequent data sync without distributed streaming overhead
  • Environments where deploying and maintaining Kafka is not feasible or desired
  • Applications that only require batch ETL processes, not real-time change streaming

Pros & Cons

Pros

Low-Impact Streaming

Captures and streams committed database changes with minimal impact on source systems, as highlighted in the Key Features for low-latency operation.

Unified Change Model

Provides a single model of all change events across various DBMS, abstracting database-specific intricacies, which simplifies application development per the README.

Fault-Tolerant Architecture

Leverages Kafka and Kafka Connect for durable, replicated, and totally-ordered event logs, ensuring reliability and scalability in distributed setups.

Exactly-Once Semantics

Guarantees reliable event delivery with exactly-once semantics, allowing clients to resume processing from any point, a key feature for data integrity.

Embedded Connector Option

Offers a lightweight embedded engine for applications that prefer direct event consumption without Kafka, adding flexibility as mentioned in the Basic Architecture section.

Cons

Kafka Dependency Complexity

For full fault-tolerant features, it requires Kafka and Kafka Connect, adding operational overhead and infrastructure management that can be daunting for small teams.

Database-Specific Limitations

Performance and capabilities vary per connector; some databases have restricted CDC support or require complex configurations, as hinted in the README's mention of tailored modules.

Setup and Configuration Overhead

Setting up involves configuring connectors, database permissions, and Kafka, which is non-trivial and requires Docker for builds, indicating a steep initial learning curve.

Potential Performance Impact

CDC processes can add load to source databases, and while minimized, monitoring is needed to manage impact in high-throughput scenarios, a trade-off acknowledged in the philosophy.

Frequently Asked Questions

Quick Stats

Stars12,642
Forks2,913
Contributors0
Open Issues0
Last commit2 days ago
CreatedSince 2016

Tags

#database#event-driven-architecture#cqrs#database-monitoring#data-integration#change-data-capture#real-time-sync#kafka#event-streaming#data-streaming#data-pipeline#etl#apache-kafka

Built With

M
Maven
g
git
J
Java
D
Docker
A
Apache Kafka

Links & Resources

Website

Included in

Integration523
Auto-fetched 1 day ago

Related Projects

Maxwell's daemon (.2k)Maxwell's daemon (.2k)

Maxwell's daemon, a mysql-to-json kafka producer

Stars4,243
Forks1,035
Last commit2 months ago
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub