Showing 11 of 47 projects
A Python library that provides a Pandas-like API on top of Apache Spark DataFrames for distributed data analysis.
A Spark library for reading and writing data between Spark SQL and MongoDB collections.
An idiomatic Clojure dataframe library that runs on Apache Spark, providing a seamless interface for data processing and machine learning.
A high-performance, type-safe DataFrame library for the JVM enabling large-scale data analysis with parallel processing capabilities.
A Clojure library providing data-frames and arrays through Python's pandas and numpy.
A Python library for blazing-fast, memory-efficient genomics data operations using DataFrames.
An open-source toolkit for analyzing web archives at scale using Apache Spark.
A Rust crate for type-conscious, tabular data manipulation with an expressive, functional interface.
A Rust DataFrame and data engineering library with PySpark/SQL-like syntax, built for business data pipelines with Microsoft stack integration.
An experimental Rust client for Apache Spark Connect, providing a DataFrame API to interact with Spark clusters.
A Julia package that reads binary and transport files from Stata, SPSS, and SAS using the ReadStat C library.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.