PySpark is the Python API for Apache Spark, a unified analytics engine for large-scale data processing, enabling distributed data processing with Python.. There is currently 1 open-source alternative to PySpark, with a combined total of 139 GitHub stars. The most common language among these projects is Rust.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.