Showing 12 of 12 projects
A real-time, no-code ORM that provides APIs and documentation automatically, allowing frontend clients to customize JSON responses.
A comprehensive JVM-based deep learning ecosystem for building, training, and deploying models with support for model import and distributed training.
A high-performance distributed POSIX file system for cloud-native environments, storing data in object storage and metadata in databases.
A fast distributed SQL query engine for big data analytics, enabling interactive queries across diverse data sources.
An open source machine learning server for developers and data scientists, supporting event collection, algorithm deployment, and REST API queries.
A distributed caching platform that bridges computation frameworks and storage systems for large-scale analytics and ML workloads.
Azkaban is a batch workflow job scheduler created at LinkedIn to manage Hadoop jobs.
A Scala API for Cascading that simplifies writing Hadoop MapReduce jobs with Scala integration.
A distributed, multi-tenant gateway providing serverless SQL on data warehouses and lakehouses.
Native integration library for using Elasticsearch with Hadoop, Spark, and Hive for real-time search and analytics on big data.
A graph database framework for storing and querying large-scale graphs with rich properties and in-database aggregation.
A federated Big Data orchestration service that simplifies job execution across distributed clusters by abstracting infrastructure complexity.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.