Showing 11 of 11 projects
A high-performance, S3-compatible distributed object storage system built in Rust, optimized for data lakes and AI workloads.
An enterprise distributed database ecosystem that enhances heterogeneous databases with sharding, scalability, and security via JDBC and Proxy access layers.
A curated list of awesome big data frameworks, resources, and tools across various categories.
A curated list of awesome big data frameworks, resources, and tools across various categories.
An open-source enterprise data warehouse built in Rust for AI agents, analytics, vector search, and full-text search.
A high-performance Python DataFrame library for lazy out-of-core processing and visualization of billion-row datasets at interactive speeds.
An open data lakehouse platform for incremental data processing with upserts, deletes, and time-travel queries.
.NET for Apache Spark provides high-performance .NET APIs for Apache Spark, enabling C# and F# developers to work with structured and streaming data.
An easy-to-use, self-hosted SQL reporting application for creating interactive business intelligence dashboards.
A federated Big Data orchestration service that simplifies job execution across distributed clusters by abstracting infrastructure complexity.
A Python library for agile data preparation workflows that works with Pandas, Dask, cuDF, Dask-cuDF, Vaex, and PySpark.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.