Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Clustering

Clustering

65 projects

Showing 29 of 65 projects

GitHub repository
GitHub repositoryPython

A PyTorch framework for semantic segmentation of large 3D point clouds using superpoint graphs.

#lidar#graph-neural-networks#3d-computer-vision
Stars803
Forks215
Last commit2 years ago
ggfortify
ggfortifyR

Unified ggplot2 interface for visualizing statistical results from popular R packages.

#statistical-analysis#r-package#survival-analysis
Stars539
Forks68
Last commit7 months ago
aembke/fred
aembke/fredRust

An asynchronous Rust client for Valkey and Redis with support for RESP2/RESP3, clustering, TLS, and advanced features.

#database-driver#async-rust#redis-client
Stars525
Forks80
Last commit1 year ago
streamDM
streamDMScala

A Spark Streaming library for mining big data streams with incremental learning algorithms.

#classification#stream-mining#data-streams
Stars497
Forks141
Last commit3 years ago
cdlib
cdlibPython

A Python meta-library for community detection in complex networks, implementing algorithms, fitness functions, and visualization.

#networkx#igraph#python-library
Stars425
Forks77
Last commit5 months ago
heatmaply
heatmaplyR

An R package for creating interactive, cluster-based heatmaps using plotly for online publishing and data exploration.

#dendrogram#r-package#statistical graphics
Stars406
Forks75
Last commit6 months ago
trident-ml
trident-mlJava

A real-time online machine learning library built on Apache Storm for scalable stream processing with incremental algorithms.

#stream-processing#java-library#text-classification
Stars384
Forks85
Last commit2 years ago
Malheur
MalheurC

A tool for automatic analysis of malware behavior using machine learning to identify, cluster, and classify malicious software.

#sandbox-analysis#classification#malware-analysis
Stars374
Forks102
Last commit7 years ago
Clustering
ClusteringJulia

A Julia package providing comprehensive clustering algorithms and validation metrics for data analysis.

#julia#k-means#data-science
Stars373
Forks123
Last commit6 months ago
Goakt
GoaktGo

A distributed actor framework for Go that enables building scalable, reactive systems with typed messages and clustering.

#distributed#observability#distributed-systems
Stars355
Forks35
Last commit5 days ago
corrplot
corrplotR

An R package for visualizing correlation matrices with automatic variable reordering to reveal hidden patterns.

#scientific-visualization#correlation-analysis#r-package
Stars346
Forks90
Last commit1 year ago
Jubilee
JubileeRuby

A high-performance Rack-compatible HTTP server built on Vert.x for JRuby applications.

#event-driven#http-server#jruby
Stars326
Forks19
Last commit9 years ago
clml
clmlCommon Lisp

A high-performance, large-scale statistical machine learning library written in Common Lisp.

#statistical-analysis#dimensionality-reduction#classification
Stars267
Forks35
Last commit4 years ago
scicloj.ml
scicloj.mlClojure

An idiomatic Clojure machine learning library providing a unified interface for classification, regression, and unsupervised models.

#metamorph#tech-ml-dataset#hyperparameter-optimization
Stars238
Forks16
Last commit7 months ago
bioyino
bioyinoRust

A high-performance, high-precision multithreaded StatsD server written in Rust with clustering and fault tolerance.

#high-performance#metrics-collection#statsd-server
Stars236
Forks20
Last commit2 years ago
Rafty
RaftyC#

A .NET Core implementation of the Raft consensus algorithm, designed for building distributed systems without external dependencies.

#raft-algorithm#raft-consensus#raft-server
Stars200
Forks28
Last commit10 months ago
pup
pupTypeScript

A universal process manager built in Deno to keep scripts, applications, and services alive across platforms.

#process-manager#rest-api#deno
Stars194
Forks6
Last commit3 months ago
elixir-on-docker
elixir-on-dockerElixir

A Docker-based template for quickly developing clustered Elixir applications with cloud-native defaults.

#elixir#web-server#development-tool
Stars175
Forks10
Last commit6 years ago
MLKit
MLKitSwift

A simple machine learning framework written in Swift, currently focusing on regression algorithms.

#genetic-algorithms#machine-learning-library#data-science
Stars153
Forks14
Last commit7 years ago
clj-ml
clj-mlClojure

A machine learning library for Clojure built on top of Weka, providing filters, classifiers, regression, and clustering algorithms.

#data-science#classification#weka
Stars134
Forks20
Last commit4 years ago
d3-scale-cluster
d3-scale-clusterJavaScript

A D3 scale that clusters continuous data into discrete groups using a 1D clustering algorithm, similar to quantile scales.

#web-worker#ckmeans#choropleth
Stars132
Forks10
Last commit3 years ago
unitdb
unitdbGo

A blazing fast specialized time-series database optimized for IoT, real-time connected devices, and AI analytics.

#iot#database#high-performance
Stars124
Forks11
Last commit3 years ago
ganitha
ganithaScala

A Scalding library for machine learning and statistical analysis, featuring Mahout vector integration, K-Means clustering, and Naive-Bayes classifiers.

#statistical-analysis#classification#scalding
Stars109
Forks12
Last commit11 years ago
kmeans-clusterer
kmeans-clustererRuby

A Ruby implementation of k-means clustering with k-means++ initialization, silhouette scoring, and multiple runs for optimal results.

#k-means#kmeans-clustering#narray
Stars99
Forks17
Last commit5 years ago
unsup
unsupLua

A Torch package providing unsupervised learning modules and algorithms like autoencoders, PCA, and k-means.

#autoencoder#k-means#dimensionality-reduction
Stars86
Forks36
Last commit9 years ago
Multilingual Latent Dirichlet Allocation LDA
Multilingual Latent Dirichlet Allocation LDAPython

A Python pipeline for multilingual text clustering using Latent Dirichlet Allocation with stop words removal, n-gram features, and inverse stemming.

#n-grams#stemming#multilingual
Stars83
Forks29
Last commit
go-pr
go-prGo

A pattern recognition library for Go providing classification, clustering, and feature extraction algorithms.

#classification#go-library#pattern-recognition
Stars68
Forks14
Last commit13 years ago
Clatern
ClaternClojure

A Clojure library providing machine learning algorithms with simple APIs for data preprocessing and modeling.

#random-forest#csv-io#classification
Stars67
Forks12
Last commit10 years ago
weka
wekaRuby

A JRuby gem providing Ruby interfaces for Weka's machine learning and data mining algorithms.

#jruby#weka-wrapper#data-science
Stars65
Forks8
Last commit5 months ago
PreviousPage 2 of 2

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
1 year ago
#Machine Learning33
#Classification20
#Data Science16
#Regression13
#Python9
#Time Series8
#Real Time6
#Distributed Systems6
#Unsupervised Learning6
#Data Visualization6
#High Performance5
#Data Mining5