Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
RHadoop
Apache Hivemall is a scalable machine learning library that runs on Apache Hive, Spark and Pig
BigDL is a distributed deep learning library for Apache Spark; with BigDL, users can write their deep learning applications as standard Spark programs, which can directly run on top of existing Spark or Hadoop clusters