Showing 36 of 72 projects
A Python ETL framework for stream processing, real-time analytics, and building live LLM/RAG pipelines, powered by a scalable Rust engine.
A JVM library for composing asynchronous and event-based programs using observable sequences.
A distributed event streaming platform for building high-performance data pipelines, streaming analytics, and data integration.
A high-performance, end-to-end observability data pipeline for collecting, transforming, and routing logs and metrics.
A high-performance, end-to-end observability data pipeline for collecting, transforming, and routing logs and metrics.
A platform for building highly responsive, resilient, and scalable distributed systems using the actor model.
AutoMQ is a cloud-native, diskless Kafka alternative that uses S3 for storage, offering 10x cost savings, autoscaling in seconds, and single-digit ms latency.
A Go library for building event-driven applications with message streams, supporting various pub/sub implementations.
Enterprise-grade event streaming platform that continuously ingests, processes, and serves real-time data with Apache Iceberg™ integration.
An enterprise-grade event streaming platform that ingests, processes, and manages real-time event data with PostgreSQL compatibility and Apache Iceberg™ integration.
A high-performance, declarative stream processor that connects various sources and sinks with built-in data transformation capabilities.
A high-performance, resilient stream processor that connects various sources and sinks, performs data transformations, and guarantees at-least-once delivery.
A curated list of data engineering tools, frameworks, databases, and resources for software developers.
A curated list of data engineering tools, frameworks, databases, and resources for software developers.
A library for event-driven programming in .NET using a composable, declarative model for processing live data streams.
A library for event-driven programming in .NET using a composable, declarative model with LINQ over observable sequences.
A Node.js wrapper for GraphicsMagick and ImageMagick providing programmatic image processing capabilities.
A unified real-time data platform combining stream processing with a fast data store for instant action on data-in-motion.
A real-time data integration platform that creates and continually updates consistent views of transactional data using SQL.
An open data lakehouse platform for incremental data processing with upserts, deletes, and time-travel queries.
A functional JavaScript utility library with lazy evaluation for optimal performance and memory efficiency.
An event-native database platform engineered for modern software applications and event-driven architectures.
A high-performance multiple regex matching library using hybrid automata for simultaneous pattern matching across data streams.
A CLI tool and dataflow engine that lets you query and join data from multiple databases and file formats using SQL.
A lean distributed data streaming engine and stream processing framework written in Rust for building responsive data-intensive applications.
A distributed data streaming engine with stateful stream processing for building responsive data-intensive applications.
Confluent's high-performance Golang client for Apache Kafka, built on librdkafka with commercial support.
An asynchronous Python framework for building services that interact with Apache Kafka, RabbitMQ, NATS, and Redis event streams.
A glib-like cross-platform C library providing modules for streams, coroutines, containers, algorithms, and more to simplify C development.
A .NET port of the Akka actor model framework for building concurrent, distributed, and fault-tolerant systems in C# and F#.
A distributed stream processing engine in Rust that performs stateful computations on real-time data with subsecond results.
A standard specification for asynchronous stream processing with non-blocking backpressure on the JVM.
Detect binary file types from buffers, streams, or files by checking magic numbers.
A chunk-based JSON parser and generator for Objective-C, enabling stream processing of JSON data.
Apache Heron is a real-time, distributed, fault-tolerant stream processing engine developed by Twitter.
A high-performance distributed map/reduce system with DAG execution, written in Go, supporting standalone or distributed modes.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.