Showing 9 of 9 projects
A distributed storage system for object storage (S3), file systems, and Iceberg tables, optimized for billions of files with O(1) disk access.
A high-performance distributed POSIX file system for cloud-native environments, storing data in object storage and metadata in databases.
An embeddable C++ storage engine for dense and sparse multi-dimensional arrays, dataframes, and key-value stores.
A native Go client library and command-line tool for HDFS that connects directly to the namenode via protocol buffers.
LinkedIn's previous generation Kafka to HDFS pipeline for batch data ingestion.
A pure Python HDFS client and Hadoop minicluster wrapper for interacting with Hadoop Distributed File System.
A collection of R packages for interacting with Hadoop ecosystems, enabling big data analysis from R.
Interactive visualization tool for monitoring Hadoop HDFS cluster usage and file storage efficiency.
Run MPI programs on Hadoop YARN clusters using MPICH-3.1.2 and SSH for distributed computing.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.