Showing 33 of 33 projects
A low-latency platform for change data capture (CDC) that streams row-level changes from databases to applications.
A diskless Kafka alternative that runs on S3, offering 10x cost savings, autoscaling in seconds, and single-digit ms latency.
AutoMQ is a cloud-native, diskless Kafka alternative that uses S3 for storage, offering 10x cost savings, autoscaling in seconds, and single-digit ms latency.
Deep Lake is a multimodal data lake and vector store optimized for AI, enabling scalable data management, retrieval, and training for LLM and deep learning applications.
A lightweight command-line tool for producing, consuming, and inspecting Apache Kafka messages, similar to netcat for Kafka.
A lean distributed data streaming engine and stream processing framework written in Rust for building responsive data-intensive applications.
A distributed data streaming engine with stateful stream processing for building responsive data-intensive applications.
A MySQL change data capture daemon that streams database changes as JSON to Kafka, Kinesis, and other platforms.
Apache Heron is a real-time, distributed, fault-tolerant stream processing engine developed by Twitter.
A curated list of awesome streaming frameworks, applications, readings, and resources for stream processing.
An open specification for streaming massive heterogeneous 3D geospatial datasets across desktop, web, and mobile applications.
A one-stop, full-scenario integration framework for massive data, supporting data ingestion, synchronization, and subscription.
A high-availability, high-performance Java message queue system similar to Apache Kafka with optimizations for production use.
A Go library for encoding and decoding Avro data to and from binary and textual JSON formats.
A distributed event bus broker providing a RESTful API abstraction over Kafka-like queues for real-time data streaming.
A specification for delimiting JSON objects with newlines in stream protocols and file storage.
Generates an octree LOD structure for streaming and real-time rendering of massive point clouds in web browsers and desktop applications.
A fast, simple, and robust Cassandra/ScyllaDB driver for Elixir with native protocol support.
A web UI for managing Avro schemas in Confluent Schema Registry, enabling creation, viewing, searching, evolution, and configuration.
A Java library for building efficient and reliable producer applications for Amazon Kinesis Data Streams.
A pure Go library for building ROS 1 client nodes, enabling lightweight cross-platform robotics and data streaming applications.
A simple and reliable Elixir library for capturing Postgres change events (CDC) via logical replication.
A CLI tool for managing, consuming, and publishing messages to Kafka clusters with protocol buffer support.
An open specification for streaming and distributing large volumes of 3D geographic data across web, mobile, and cloud platforms.
A utility for scaling Amazon Kinesis Streams manually or automatically, similar to EC2 Auto Scaling.
A Fluentd output plugin for sending log events to Amazon Kinesis Data Streams and Amazon Data Firehose.
A high-performance MongoDB client for R, built on libmongoc and jsonlite, supporting aggregation, indexing, and streaming.
A high-performance MongoDB client for R, built on libmongoc and jsonlite, supporting aggregation, indexing, and streaming.
A snappy open-source proxy for Apache Kafka that enables encryption, multi-tenancy, and schema validation.
A lightweight, thread-safe, append-only in-memory log data structure inspired by Kafka, for Go applications.
A distributed input/output pipe for streaming data between computers using Hypercore.
Completed code for the Amazon Kinesis tutorial on processing real-time stock data using KPL and KCL.
A persistent, embeddable NoSQL database for Deno and TypeScript with a MongoDB-like API.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.