Showing 24 of 24 projects
An all-in-one open-source platform for product analytics, feature flags, session replay, experiments, and more to help build successful products.
Open-source data integration platform for building ELT pipelines from APIs, databases, and files to data warehouses, lakes, and lakehouses.
A curated list of awesome big data frameworks, resources, and tools across various categories.
A curated list of awesome big data frameworks, resources, and tools across various categories.
A high-performance real-time analytics database designed for fast queries and ingest to reduce time to insight.
A transformation tool that enables data analysts and engineers to transform data using software engineering best practices.
A transformation workflow that enables data teams to transform data in their warehouse using SQL and software engineering best practices.
An open-source enterprise data warehouse built in Rust for AI agents, analytics, vector search, and full-text search.
A curated list of data engineering tools, frameworks, databases, and resources for software developers.
An open-source, privacy-focused customer data platform (CDP) that collects, processes, and routes event data to warehouses and tools.
A collection of utilities, scripts, and views for managing, optimizing, and automating Amazon Redshift data warehouse operations.
A distributed, multi-tenant gateway providing serverless SQL on data warehouses and lakehouses.
An open-source Reverse ETL platform for syncing data from warehouses to business tools like Salesforce, HubSpot, and Slack.
A collection of utilities, scripts, UDFs, and dashboards for BigQuery migration, optimization, and data warehouse operations.
An advanced open-source MPP database for data warehousing, large-scale analytics, and AI/ML workloads.
A simple, fast, and flexible ETL framework for .NET with built-in readers and writers for CSV, JSON, XML, Parquet, and more.
A data API framework that turns SQL into secure RESTful APIs for AI agents and data applications.
A large-scale data warehouse system that provides approximate query answers with error bounds on massive datasets up to 300x faster than Hive.
An AWS Lambda function that automatically loads files from S3 into Amazon Redshift clusters with zero server administration.
A Python CLI tool for comparing data across heterogeneous databases and data warehouses to ensure migration accuracy.
A fast, scalable data warehouse that caches and provides advanced querying for Puppet infrastructure data.
A web client for SQL-like query engines including Hive, Presto, and BigQuery, written in Node.js.
A web-based tool for monitoring and managing Amazon Redshift clusters, providing insights into queries, WLM queues, tables, and load errors.
Terraform module for provisioning and managing AWS Redshift clusters and related resources.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.