Showing 6 of 6 projects
A Clojure library for writing map-reduce queries that compile to Apache Pig or Cascading, enabling distributed data processing with Clojure syntax.
A visualization framework for Apache Pig workflows that combines graphical depictions with real-time execution information.
A scalable machine learning library that runs on Apache Hive, Spark, and Pig for distributed ML directly in SQL.
An open-source big data security analytics tool that analyzes network packet capture (pcap) files using Apache Pig.
A scalable malware processing and analytics platform built on Hadoop Pig for binary data extraction and analysis.
A collection of libraries for large-scale data processing in Hadoop ecosystems, including Spark, Pig, and incremental MapReduce.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.