Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Stacks
  3. scikit-learn
S

scikit-learn

Framework
122 projects403.4k total stars103.1k total forks7 languages

Open-source projects built with scikit-learn

There are currently 122 open-source projects built with scikit-learn, with a combined total of 403.4k GitHub stars. The most common language among these projects is Python.

Showing 119 open-source projects · page 3 of 4

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
SUAVE
SUAVEsuavecode/SUAVE

A multi-fidelity conceptual design environment for modeling future aircraft with advanced technologies.

514450ReScript
2 years ago
sklearn-expertsys
sklearn-expertsystmadl/sklearn-expertsys

A scikit-learn compatible classifier that produces human-interpretable decision rules instead of black box models.

49071Python
8 years ago
CellTypist
CellTypistTeichlab/celltypist

An automated cell type annotation tool for single-cell RNA-seq data using logistic regression classifiers.

48759Python
15 days ago
open-solution-home-credit
open-solution-home-creditneptune-ml/open-solution-home-credit

An open-source machine learning solution for the Home Credit Default Risk Kaggle competition, providing reproducible code and experiments.

464171Python
4 years ago
Suite2p
Suite2pMouseLand/suite2p

A complete pipeline for processing two-photon calcium imaging data, including registration, ROI detection, signal extraction, and spike deconvolution.

452274Python
5 days ago
RuleFit
RuleFitchristophM/rulefit

Python implementation of the RuleFit algorithm for interpretable machine learning predictions using rule ensembles.

446120Python
2 years ago
imbalanced-ensemble
imbalanced-ensembleZhiningLiu1998/imbalanced-ensemble

A Python library for class-imbalanced ensemble learning with 30+ algorithms, built on scikit-learn.

42660Python
3 months ago
scikit-rebate
scikit-rebateEpistasisLab/scikit-rebate

A scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for machine learning.

42172Python
3 years ago
PyImSegm
PyImSegmBorda/pyImSegm

A Python toolbox for image segmentation featuring superpixel segmentation, object center detection, and region growing with shape priors.

37674Python
4 years ago
FairML
FairMLadebayoj/fairml

A Python toolbox for auditing machine learning models to detect and quantify bias in black-box predictions.

36875Python
5 years ago
LSTM-Crypto-Price-Prediction
LSTM-Crypto-Price-PredictionSC4RECOIN/LSTM-Crypto-Price-Prediction

Predicts Bitcoin price trends using an LSTM-RNN with technical indicators for automated trading via the Binance API.

36356Python
4 years ago
scBERT
scBERTTencentAILabHealthcare/scBERT

A BERT-based foundation model pretrained on large-scale scRNA-seq data for automated cell type annotation in single-cell analysis.

35769Python
2 years ago
End-To-End Memory Networks
End-To-End Memory Networksdomluna/memn2n

A TensorFlow implementation of End-To-End Memory Networks with a scikit-learn-like interface for bAbI tasks.

340131Python
9 years ago
skpro
skproalan-turing-institute/skpro

A scikit-learn compatible Python library for probabilistic regression, survival analysis, and probability distributions.

327188Python
4 days ago
Deep stacked residual bidirectional LSTMs for HAR
Deep stacked residual bidirectional LSTMs for HARguillaume-chevalier/HAR-stacked-residual-bidir-LSTMs

A deep learning architecture using stacked residual bidirectional LSTM cells with TensorFlow for human activity recognition from sensor data.

32397Python
3 years ago
DeepDTA
DeepDTAhkmztrk/DeepDTA

Deep learning model using convolutional neural networks to predict drug-target binding affinity from protein sequences and compound SMILES.

301117Python
2 years ago
mol2vec
mol2vecsamoturk/mol2vec

An unsupervised machine learning approach to learn vector representations of molecular substructures for cheminformatics.

290121Python
3 years ago
DescriptaStorus
DescriptaStorusbp-kelley/descriptastorus

A Python library for fast random access to chemical descriptors and molecule indices, optimized for machine learning workflows.

28067Python
1 year ago
Geosnap
Geosnapspatialucr/geosnap

A Python package for exploring, modeling, and visualizing neighborhood and regional change over time using geospatial data.

27332Python
3 months ago
PyODDS
PyODDSdatamllab/pyodds

An end-to-end Python outlier detection system with database support, automated machine learning, and unified APIs for statistical, ML, and deep learning models.

25539Python
3 years ago
scikit-rvm
scikit-rvmJamesRitchie/scikit-rvm

A scikit-learn compatible Python implementation of the Relevance Vector Machine for sparse Bayesian learning.

23775Python
9 months ago
CropHarvest
CropHarvestnasaharvest/cropharvest

An open-source remote sensing dataset and pipeline for agricultural land use classification, featuring 95,186 datapoints with satellite and climatology data.

23456Jupyter Notebook
2 years ago
BoostARoota
BoostARootachasedehan/BoostARoota

A fast feature selection algorithm for tree-based models like XGBoost, designed to outperform Boruta in speed and performance.

23336Python
5 years ago
Stacking
Stackingikki407/stacking

A Python library for stacked generalization (ensemble learning) that supports scikit-learn, XGBoost, and Keras models with out-of-fold prediction saving.

23075Python
8 years ago
Rosetta
Rosettacolumbia-applied-data-science/rosetta

A Python toolkit for text-focused data science on medium-sized datasets, bridging memory and cluster-scale processing.

20745Jupyter Notebook
3 years ago
visualize_ML
visualize_MLayush1997/visualize_ML

A Python package for automated univariate and bivariate data analysis and visualization to streamline machine learning workflows.

20529Python
9 years ago
ChemML
ChemMLhachmannlab/chemml

A Python machine learning and informatics suite for analyzing, mining, and modeling chemical and materials data.

17633Python
1 month ago
CellProfiler Analyst
CellProfiler AnalystCellProfiler/CellProfiler-Analyst

Interactive exploration and analysis software for large, high-dimensional image-derived biological data with supervised machine learning.

17075Python
10 months ago
RiskInDroid
RiskInDroidClaudiuGeorgiu/RiskInDroid

A machine learning tool for quantitative risk analysis of Android apps by analyzing declared and actual permission usage.

16231Python
13 days ago
DH3D
DH3DJuanDuGit/DH3D

A deep learning approach that unifies global place recognition and local 6DoF pose refinement for robust relocalization in large-scale 3D point clouds.

15817Python
5 years ago
DrugBAN
DrugBANpeizhenbai/DrugBAN

A deep bilinear attention network framework with adversarial domain adaptation for interpretable drug-target interaction prediction.

14818Python
3 years ago
topicwizard
topicwizardx-tabdeveloping/topic-wizard

Interactive topic model visualization and interpretation library for Python, compatible with sklearn, Gensim, BERTopic, and Turftopic.

14817Python
1 year ago
Deep Belief Nets for Topic Modeling
Deep Belief Nets for Topic Modelinglarsmaaloee/deep-belief-nets-for-topic-modeling

A Python toolbox using deep belief networks for topic modeling on document data, producing latent representations for content-based recommendation.

14456Python
11 years ago
Dreamento
Dreamentodreamento/dreamento

An open-source Python toolbox for real-time EEG monitoring, analysis, and sensory stimulation during sleep for dream engineering research.

14212Python
2 years ago
Pycytominer
Pycytominercytomining/pycytominer

A Python package for processing and normalizing high-dimensional morphological feature data from high-throughput cell imaging experiments.

14240Python
2 days ago
windML
windMLcigroup-ol/windml

A Python framework for accessing wind data sources and performing renewable energy forecasting and prediction.

13042
2 years ago
1
2
3
4