Showing 6 of 78 projects
An experimental Rust client for Apache Spark Connect, providing a DataFrame API to interact with Spark clusters.
An open-source framework for developing large-scale anomaly detection models using Apache Spark.
A PMML evaluator library for Apache Spark that provides ML-compatible transformers for deploying predictive models.
A Docker container providing a complete streaming environment for experimenting with Kafka, Spark Streaming, and Cassandra.
A collection of interactive Jupyter notebooks for learning Hadoop, Spark, and MapReduce with hands-on tutorials and demos.
A Spark application for migrating data to ScyllaDB from CQL-compatible databases or DynamoDB via Alternator.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.