Showing 14 of 14 projects
An open-source framework for building LLM-powered applications with data ingestion, indexing, and retrieval capabilities.
A scalable time series database optimized for real-time metrics, events, and analytics with fast query response.
A realtime distributed messaging platform designed to operate at scale, handling billions of messages per day.
An open-source time-series database for high-speed ingestion and low-latency SQL queries.
Enterprise-grade event streaming platform that continuously ingests, processes, and serves real-time data with Apache Iceberg™ integration.
An enterprise-grade event streaming platform that ingests, processes, and manages real-time event data with PostgreSQL compatibility and Apache Iceberg™ integration.
A CLI tool to copy data between any databases and platforms with a single command, no code required.
Build concurrent, multi-stage data ingestion and processing pipelines with Elixir, supporting back-pressure, batching, and fault tolerance.
A distributed service for efficiently collecting, aggregating, and moving large amounts of log-like data.
A real-time distributed analytical database built entirely on bitmaps for low-latency queries on fresh data.
A distributed data integration framework for big data ecosystems, handling ingestion, replication, organization, and lifecycle management for both streaming and batch data.
A distributed data integration framework for big data ecosystems, handling ingestion, replication, organization, and lifecycle management for both streaming and batch data.
A unified data pipeline tool for ingestion, transformation with SQL/Python/R, and data quality checks across major platforms.
A high-performance CSV ingestion and generation library for Ruby with C acceleration, designed for real-world data with intelligent defaults.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.