Showing 14 of 14 projects
An in-process analytical SQL database management system designed for high-performance data analysis.
A scalable time series database optimized for real-time metrics, events, and analytics with fast query response.
A scalable time series database optimized for real-time metrics, events, and analytics with fast query response.
An open-source storage framework that enables building a Lakehouse architecture with ACID transactions and scalable metadata handling.
A command-line tool for running SQL queries against JSON, CSV, Excel, Parquet, and other structured data files.
A blazing-fast command-line toolkit for querying, slicing, analyzing, transforming, and validating tabular data (CSV, Excel, JSONL, etc.).
A lightweight TUI application for viewing and querying tabular data files like CSV, Parquet, and JSON with SQL support.
A graph database framework for storing and querying large-scale graphs with rich properties and in-database aggregation.
An embedded database for serverless and edge runtimes, storing data as Parquet on S3 with stateless compute.
A genomics analysis platform that uses Apache Spark to parallelize genomic data processing across clusters, replacing traditional file-based workflows.
A simple, fast, and flexible ETL framework for .NET with built-in readers and writers for CSV, JSON, XML, Parquet, and more.
Global open dataset of aggregated fixed and mobile network performance metrics (download/upload/latency) in geospatial tiles.
A Go library that generates type-safe Parquet readers and writers from Go structs or existing Parquet files.
A Spark application for migrating data to ScyllaDB from CQL-compatible databases or DynamoDB via Alternator.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.