Showing 13 of 13 projects
A real-time, no-code ORM that provides APIs and documentation automatically, allowing frontend clients to customize JSON responses.
A fast distributed SQL query engine for big data analytics, enabling interactive queries across diverse data sources.
A no-dependency Python SQL parser, transpiler, optimizer, and engine that translates between 31+ SQL dialects.
A high-performance table format for huge analytic datasets, enabling multiple engines to safely work with the same tables simultaneously.
A distributed, multi-tenant gateway providing serverless SQL on data warehouses and lakehouses.
A Big Data IDE for discovering, creating, and sharing data analyses, queries, and tables with collocated metadata.
A library enabling MongoDB to serve as input source or output destination for Hadoop MapReduce tasks and ecosystem tools.
A framework enabling spatial data analysis within Hadoop ecosystems using Hive and SparkSQL.
An open-source unit test framework for Hive SQL queries, enabling TDD without installed dependencies via JUnit 4 and 5.
A Hadoop library for reading and processing packet capture (PCAP) files in MapReduce jobs and Hive queries.
A Go-based toolkit for fast ETL and feature extraction on Hadoop, optimized for rapid development and execution.
A web client for SQL-like query engines including Hive, Presto, and BigQuery, written in Node.js.
Mozilla's utility library for Hadoop, HBase, Pig, and related big data technologies.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.