Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Data Streaming

Data Streaming

33 projects

Showing 33 of 33 projects

Debezium (k)
Debezium (k)Java

A low-latency platform for change data capture (CDC) that streams row-level changes from databases to applications.

#database#event-driven-architecture#cqrs
Stars12.8k
Forks3.0k
Last commit3 days ago
AutoMQ
AutoMQJava

A diskless Kafka alternative that runs on S3, offering 10x cost savings, autoscaling in seconds, and single-digit ms latency.

#diskless#kafka-alternative#message-queue
Stars10.0k
Forks723
Last commit13 hours ago
AutoMQ
AutoMQJava

AutoMQ is a cloud-native, diskless Kafka alternative that uses S3 for storage, offering 10x cost savings, autoscaling in seconds, and single-digit ms latency.

#stream-processing#apache-kafka-compatible#diskless
Stars10.0k
Forks723
Last commit
hub
hubC++

Deep Lake is a multimodal data lake and vector store optimized for AI, enabling scalable data management, retrieval, and training for LLM and deep learning applications.

#ai#postgres#data-versioning
Stars9.2k
Forks710
Last commit18 days ago
kcat (.7k)
kcat (.7k)C

A lightweight command-line tool for producing, consuming, and inspecting Apache Kafka messages, similar to netcat for Kafka.

#devops#message-queue#command-line-tool
Stars5.8k
Forks502
Last commit1 year ago
fluvio
fluvioRust

A lean distributed data streaming engine and stream processing framework written in Rust for building responsive data-intensive applications.

#stream-processing#event-driven#webassembly
Stars5.2k
Forks528
Last commit9 days ago
fluvio
fluvioRust

A distributed data streaming engine with stateful stream processing for building responsive data-intensive applications.

#stream-processing#event-driven#webassembly
Stars5.2k
Forks528
Last commit9 days ago
Maxwell's daemon (.2k)
Maxwell's daemon (.2k)Java

A MySQL change data capture daemon that streams database changes as JSON to Kafka, Kinesis, and other platforms.

#change-data-capture#database-replication#kafka
Stars4.3k
Forks1.0k
Last commit15 days ago
Apache Heron (incubating)
Apache Heron (incubating)Java

Apache Heron is a real-time, distributed, fault-tolerant stream processing engine developed by Twitter.

#stream-processing#real-time-analytics#distributed-systems
Stars3.6k
Forks583
Last commit3 years ago
Streaming
Streaming

A curated list of awesome streaming frameworks, applications, readings, and resources for stream processing.

#stream-processing#message-queue#real-time-analytics
Stars3.0k
Forks317
Last commit21 hours ago
3d-tiles
3d-tilesBatchfile

An open specification for streaming massive heterogeneous 3D geospatial datasets across desktop, web, and mobile applications.

#point-clouds#3d-geospatial#geospatial
Stars2.5k
Forks488
Last commit2 days ago
Apache InLong (.4k)
Apache InLong (.4k)Java

A one-stop, full-scenario integration framework for massive data, supporting data ingestion, synchronization, and subscription.

#massive-data-integration#stream-processing#batch-processing
Stars1.5k
Forks568
Last commit5 days ago
metaq
metaqJava

A high-availability, high-performance Java message queue system similar to Apache Kafka with optimizations for production use.

#taobao#high-performance#message-queue
Stars1.3k
Forks676
Last commit6 years ago
goavro
goavroGo

A Go library for encoding and decoding Avro data to and from binary and textual JSON formats.

#schema-evolution#binary-encoding#golang
Stars1.1k
Forks229
Last commit4 months ago
Nakadi
NakadiJava

A distributed event bus broker providing a RESTful API abstraction over Kafka-like queues for real-time data streaming.

#event-driven-architecture#distributed-systems#rest-api
Stars967
Forks294
Last commit2 years ago
NDJSON
NDJSON

A specification for delimiting JSON objects with newlines in stream protocols and file storage.

#data-serialization#tcp#file-format
Stars830
Forks34
Last commit3 years ago
PotreeConverter
PotreeConverterJavaScript

Generates an octree LOD structure for streaming and real-time rendering of massive point clouds in web browsers and desktop applications.

#lidar#3d-visualization#geospatial
Stars802
Forks478
Last commit5 months ago
xandra
xandraElixir

A fast, simple, and robust Cassandra/ScyllaDB driver for Elixir with native protocol support.

#elixir#connection-pooling#cassandra-driver
Stars428
Forks60
Last commit4 days ago
Schema Registry UI
Schema Registry UIJavaScript

A web UI for managing Avro schemas in Confluent Schema Registry, enabling creation, viewing, searching, evolution, and configuration.

#confluent-platform#kafka#docker
Stars425
Forks112
Last commit2 years ago
amazon-kinesis-producer
amazon-kinesis-producerC++

A Java library for building efficient and reliable producer applications for Amazon Kinesis Data Streams.

#java-library#real-time-processing#producer
Stars414
Forks343
Last commit6 days ago
goroslib
goroslibGo

A pure Go library for building ROS 1 client nodes, enabling lightweight cross-platform robotics and data streaming applications.

#robotics#ros-package#ugv
Stars367
Forks72
Last commit1 year ago
walex
walexElixir

A simple and reliable Elixir library for capturing Postgres change events (CDC) via logical replication.

#logical-replication#event-driven#elixir
Stars362
Forks21
Last commit11 days ago
trubka
trubkaGo

A CLI tool for managing, consuming, and publishing messages to Kafka clusters with protocol buffer support.

#devops#protobuf-parser#message-queue
Stars337
Forks20
Last commit1 year ago
Indexed 3D Scene Layers
Indexed 3D Scene Layers

An open specification for streaming and distributing large volumes of 3D geographic data across web, mobile, and cloud platforms.

#3d-visualization#3d-geospatial#rest-api
Stars336
Forks87
Last commit1 year ago
amazon-kinesis-scaling-utils
amazon-kinesis-scaling-utilsJava

A utility for scaling Amazon Kinesis Streams manually or automatically, similar to EC2 Auto Scaling.

#kinesis#auto-scaling#stream-scaling
Stars335
Forks84
Last commit2 years ago
aws-fluent-plugin-kinesis
aws-fluent-plugin-kinesisRuby

A Fluentd output plugin for sending log events to Amazon Kinesis Data Streams and Amazon Data Firehose.

#aws-integration#kinesis-producer#observability
Stars289
Forks96
Last commit1 month ago
mongolite
mongoliteC

A high-performance MongoDB client for R, built on libmongoc and jsonlite, supporting aggregation, indexing, and streaming.

#database-driver#r-package#aggregation
Stars288
Forks65
Last commit1 year ago
mongolite
mongoliteC

A high-performance MongoDB client for R, built on libmongoc and jsonlite, supporting aggregation, indexing, and streaming.

#database-driver#r-package#mongodb-client
Stars288
Forks65
Last commit1 year ago
Kroxylicious
KroxyliciousJava

A snappy open-source proxy for Apache Kafka that enables encryption, multi-tenancy, and schema validation.

#custom-filters#kafka-proxy#proxy
Stars285
Forks107
Last commit13 hours ago
memlog
memlogGo

A lightweight, thread-safe, append-only in-memory log data structure inspired by Kafka, for Go applications.

#log-structured-storage#streaming-api#go-library
Stars139
Forks7
Last commit14 hours ago
hyperpipe
hyperpipeJavaScript

A distributed input/output pipe for streaming data between computers using Hypercore.

#data-transfer#distributed-systems#command-line-tool
Stars119
Forks15
Last commit9 years ago
amazon-kinesis-learning
amazon-kinesis-learningJava

Completed code for the Amazon Kinesis tutorial on processing real-time stock data using KPL and KCL.

#kinesis#stock-data#real-time-processing
Stars111
Forks137
Last commit1 year ago
dndb
dndbTypeScript

A persistent, embeddable NoSQL database for Deno and TypeScript with a MongoDB-like API.

#datastore#database#database-engine
Stars78
Forks13
Last commit3 years ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
12 hours ago
#Kafka8
#Distributed Systems7
#Message Queue7
#Stream Processing6
#Apache Kafka6
#Json6
#Real Time5
#Data Pipeline4
#Cloud Native4
#Java4
#Kafka Alternative4
#Serverless4