Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Data Science

Data Science

243 projects

Showing 36 of 252 projects

JupyterLab Desktop
JupyterLab DesktopTypeScript

A cross-platform desktop application for JupyterLab, providing the easiest way to run Jupyter notebooks locally.

#desktop-application#notebook#data-science
Stars4.2k
Forks464
Last commit1 month ago
CML
CMLJavaScript

An open-source CLI tool for implementing CI/CD workflows with a focus on MLOps, automating ML experiments and reporting.

#developer-tools#devops#cicd
Stars4.2k
Forks344
Last commit10 months ago
Data Science Specialization
Data Science SpecializationHTML

Course materials for the Johns Hopkins Data Science Specialization on Coursera.

#coursera#data-science#statistics
Stars4.1k
Forks31.0k
Last commit5 years ago
"most important thing in data science is the question"
"most important thing in data science is the question"HTML

Course materials for the Johns Hopkins Data Science Specialization on Coursera.

#coursera#data-science#statistics
Stars4.1k
Forks31.0k
Last commit5 years ago
aws-sdk-pandas
aws-sdk-pandasPython

A Python library that simplifies data integration between pandas and AWS services like Athena, S3, Redshift, and more.

#apache-arrow#data-science#glue-catalog
Stars4.1k
Forks725
Last commit3 days ago
aws-data-wrangler
aws-data-wranglerPython

A Python library that simplifies data integration between pandas and AWS services like Athena, S3, Redshift, and more.

#apache-arrow#data-science#redshift
Stars4.1k
Forks725
Last commit3 days ago
Hydrogen
HydrogenTypeScript

Run code interactively, inspect data, and plot using Jupyter kernels directly inside the Atom text editor.

#data-science#atom#repl
Stars4.0k
Forks337
Last commit1 month ago
Network Analysis
Network AnalysisR

A curated list of resources for constructing, analyzing, and visualizing network data across various disciplines.

#semantic-networks#data-science#complex-networks
Stars4.0k
Forks628
Last commit
Awesome Machine Learning Interpretability
Awesome Machine Learning Interpretability

A curated list of practical resources for responsible machine learning, covering interpretability, governance, safety, and ethics.

#ai-safety#xai#model-auditing
Stars4.0k
Forks626
Last commit1 month ago
deepchecks
deepchecksPython

An open-source solution for continuous validation of machine learning models and data, from research to production.

#data-testing#ml-validation#python-library
Stars4.0k
Forks293
Last commit3 months ago
Introduction to machine learning with scikit-learn
Introduction to machine learning with scikit-learnJupyter Notebook

A collection of Jupyter notebooks accompanying a 10-part video series teaching machine learning with Python's scikit-learn library.

#video-tutorials#educational#ml-workflow
Stars3.8k
Forks2.5k
Last commit
Machine Learning For Hackers <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">
Machine Learning For Hackers <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">R

R code examples from the 'Machine Learning for Hackers' book, demonstrating practical machine learning techniques.

#code-examples#statistical-analysis#practical-ml
Stars3.8k
Forks2.2k
Last commit
Tablesaw
TablesawJava

A Java dataframe and visualization library for data loading, cleaning, transformation, and analysis.

#statistical-analysis#chart#data-science
Stars3.7k
Forks651
Last commit1 month ago
RISE
RISEJavaScript

A Jupyter/IPython extension that transforms notebooks into interactive Reveal.js slideshows with live execution.

#notebook-tools#reveal-js#presentation-tool
Stars3.7k
Forks413
Last commit2 years ago
polyaxon
polyaxon

An open-source platform for building, training, and monitoring large-scale deep learning applications with full lifecycle MLOps.

#distributed-training#hyperparameter-tuning#workflow-orchestration
Stars3.7k
Forks324
Last commit5 days ago
ggplot
ggplotPython

A Python implementation of the grammar of graphics for creating statistical visualizations.

#matplotlib#statistical graphics#data-science
Stars3.7k
Forks564
Last commit3 years ago
Leafmap
LeafmapPython

A Python package for interactive mapping and geospatial analysis with minimal coding in Jupyter notebooks.

#open-source-gis#whiteboxtools#dataviz
Stars3.7k
Forks459
Last commit22 days ago
chartify
chartifyPython

A Python library that simplifies chart creation for data scientists with consistent data formats and smart defaults.

#data-science#python#plotting
Stars3.6k
Forks331
Last commit1 year ago
Chartify
ChartifyPython

A Python library that simplifies chart creation for data scientists with consistent data formats and smart defaults.

#data-science#python#plotting
Stars3.6k
Forks331
Last commit1 year ago
Ploomber
PloomberPython

The fastest way to build data pipelines with iterative development and deployment anywhere.

#deployment#pipelines#airflow
Stars3.6k
Forks241
Last commit11 months ago
A Jupyter Notebook Blogging Platform Powered by GitHub Actions, Pages and Jekyll
A Jupyter Notebook Blogging Platform Powered by GitHub Actions, Pages and JekyllJupyter Notebook

An easy-to-use blogging platform with enhanced support for Jupyter Notebooks, Word docs, and Markdown, powered by GitHub Actions.

#actions#jekyll#data-science
Stars3.5k
Forks731
Last commit
ML Workspace
ML WorkspaceJupyter Notebook

A web-based IDE for machine learning and data science with pre-installed libraries and tools, deployable via Docker.

#jupyter-lab#data-science#deep-learning
Stars3.5k
Forks459
Last commit1 year ago
TensorWatch
TensorWatchJupyter Notebook

A debugging and visualization tool for data science, deep learning, and reinforcement learning in Jupyter Notebook.

#ai#data-science#deep-learning
Stars3.5k
Forks361
Last commit25 days ago
Ethen's Notebook Collection
Ethen's Notebook CollectionHTML

A comprehensive collection of machine learning tutorials and implementations in Python, covering algorithms from scratch to production deployment.

#python-tutorials#data-science#deep-learning
Stars3.4k
Forks669
Last commit
Data Science
Data Science

A curated list of Python software for data science, covering machine learning, deep learning, visualization, and data manipulation.

#data-science#deep-learning#awesome-list
Stars3.4k
Forks436
Last commit11 days ago
Koalas
KoalasPython

Koalas provides the pandas DataFrame API on Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.

#apache-spark#spark#mlflow
Stars3.4k
Forks368
Last commit2 years ago
xeus-cling
xeus-clingC++

A Jupyter kernel for C++ that enables interactive computing with cling interpreter and xeus protocol.

#xeus#jupyter-kernel#notebook
Stars3.3k
Forks316
Last commit5 months ago
NLP Roadmap
NLP Roadmap

A visual roadmap and keyword mind map for students learning Natural Language Processing, from basics to SOTA models.

#roadmap#keyword#data-science
Stars3.3k
Forks516
Last commit6 years ago
pgmpy
pgmpyPython

A Python toolkit for causal and probabilistic reasoning using graphical models like Bayesian Networks and Structural Equation Models.

#causal-identification#bayesian-networks#simulation
Stars3.3k
Forks1.1k
Last commit2 days ago
mljar-supervised
mljar-supervisedPython

An Automated Machine Learning Python package for tabular data with feature engineering, hyperparameter tuning, explanations, and automatic documentation.

#fairness-ml#ensemble-learning#random-forest
Stars3.3k
Forks433
Last commit
.NET Interactive
.NET InteractiveC#

An engine and API for running .NET and other languages interactively in notebooks, REPLs, and embedded coding experiences.

#multi-language#embedded-scripting#data-science
Stars3.2k
Forks450
Last commit2 days ago
shapash
shapashJupyter Notebook

A Python library that makes machine learning models interpretable and transparent through user-friendly visualizations and a web application.

#shap#python-library#explainability
Stars3.2k
Forks375
Last commit7 days ago
tslearn
tslearnPython

A Python machine learning toolkit for time series analysis with scikit-learn compatible API.

#python-library#data-science#machine-learning-algorithms
Stars3.1k
Forks371
Last commit2 days ago
xLearn
xLearnC++

A high-performance, easy-to-use, and scalable machine learning package for linear models, factorization machines, and field-aware factorization machines.

#ffm#high-performance#python-library
Stars3.1k
Forks515
Last commit2 years ago
SweetViz
SweetVizPython

A Python library for automated exploratory data analysis (EDA) with high-density visualizations and target analysis in two lines of code.

#statistical-analysis#data-science#automated-reporting
Stars3.1k
Forks289
Last commit13 days ago
shogun
shogunC++

A unified and efficient machine learning toolbox with C++ core and multi-language interfaces, developed since 1999.

#multi-language#research-tool#data-science
Stars3.1k
Forks1.0k
Last commit2 years ago
PreviousPage 5 of 7

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
7 days ago
2 years ago
7 years ago
3 years ago
1 month ago
29 days ago
Next
#Machine Learning154
#Python151
#Deep Learning58
#Data Visualization50
#Python Library36
#Data Analysis36
#Jupyter Notebook32
#Scikit Learn32
#Statistics30
#Jupyter Notebooks27
#Jupyter26
#R24