Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Big Data

Big Data

219 projects

Showing 3 of 219 projects

Hive_test
Hive_testJava

A unit test framework for Hive scripts that provides an embedded Hive environment with Derby database and HiveThriftService.

#unit-testing#apache-hive#java
Stars64
Forks47
Last commit4 years ago
emr-sample-apps
emr-sample-appsJava

Code samples demonstrating how to use popular applications on Amazon Elastic MapReduce (EMR).

#mapreduce#educational#code-samples
Stars63
Forks51
Last commit10 years ago
Map/Reduce implementations of common ML algorithms
Map/Reduce implementations of common ML algorithmsJupyter Notebook

Jupyter notebooks for hands-on Big Data Analytics exercise classes covering Spark ML, Map/Reduce algorithms, and deep learning.

#spark#educational#data-science
Stars62
Forks28
Last commit
PreviousPage 7 of 7

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
4 years ago
#Apache Spark59
#Data Processing58
#Distributed Computing50
#Hadoop41
#Spark40
#Machine Learning39
#Scala37
#Distributed Systems32
#Data Science29
#Data Engineering29
#Java29
#Stream Processing27