Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Data Science

Data Science

506 projects

Showing 36 of 506 projects

JupyterLab Desktop
JupyterLab DesktopTypeScript

A cross-platform desktop application for JupyterLab, providing the easiest way to run Jupyter notebooks locally.

#desktop-application#notebook#data-science
Stars4.2k
Forks472
Last commit4 days ago
pattern_classification
pattern_classificationJupyter Notebook

A comprehensive collection of tutorials, examples, and resources for understanding and solving machine learning and pattern classification problems.

#educational-resources#data-science#machine-learning-algorithms
Stars4.2k
Forks1.3k
Last commit
CML
CMLJavaScript

An open-source CLI tool for implementing CI/CD workflows with a focus on MLOps, automating ML experiments and reporting.

#developer-tools#devops#cicd
Stars4.2k
Forks345
Last commit1 year ago
Data Science Specialization
Data Science SpecializationHTML

Course materials for the Johns Hopkins Data Science Specialization on Coursera.

#coursera#data-science#statistics
Stars4.1k
Forks31.0k
Last commit5 years ago
"most important thing in data science is the question"
"most important thing in data science is the question"HTML

Course materials for the Johns Hopkins Data Science Specialization on Coursera.

#coursera#data-science#statistics
Stars4.1k
Forks31.0k
Last commit5 years ago
aws-data-wrangler
aws-data-wranglerPython

A Python library that simplifies data integration between pandas and AWS services like Athena, S3, Redshift, and more.

#apache-arrow#data-science#redshift
Stars4.1k
Forks732
Last commit2 days ago
aws-sdk-pandas
aws-sdk-pandasPython

A Python library that simplifies data integration between pandas and AWS services like Athena, S3, Redshift, and more.

#apache-arrow#data-science#glue-catalog
Stars4.1k
Forks732
Last commit2 days ago
Network Analysis
Network AnalysisR

A curated list of resources for constructing, analyzing, and visualizing network data across various disciplines.

#semantic-networks#data-science#complex-networks
Stars4.1k
Forks630
Last commit
Awesome Machine Learning Interpretability
Awesome Machine Learning Interpretability

A curated list of practical resources for responsible machine learning, covering interpretability, governance, safety, and ethics.

#ai-safety#xai#model-auditing
Stars4.0k
Forks629
Last commit5 days ago
deepchecks
deepchecksPython

An open-source solution for continuous validation of machine learning models and data, from research to production.

#data-testing#ml-validation#python-library
Stars4.0k
Forks299
Last commit5 months ago
Hydrogen
HydrogenTypeScript

Run code interactively, inspect data, and plot using Jupyter kernels directly inside the Atom text editor.

#data-science#atom#repl
Stars4.0k
Forks337
Last commit3 months ago
Introduction to machine learning with scikit-learn
Introduction to machine learning with scikit-learnJupyter Notebook

A collection of Jupyter notebooks accompanying a 10-part video series teaching machine learning with Python's scikit-learn library.

#video-tutorials#educational#ml-workflow
Stars3.8k
Forks2.5k
Last commit
Tablesaw
TablesawJava

A Java dataframe and visualization library for data loading, cleaning, transformation, and analysis.

#statistical-analysis#chart#data-science
Stars3.8k
Forks649
Last commit3 months ago
RISE
RISEJavaScript

A Jupyter/IPython extension that transforms notebooks into interactive Reveal.js slideshows with live execution.

#notebook-tools#reveal-js#presentation-tool
Stars3.7k
Forks410
Last commit2 years ago
Machine Learning For Hackers <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">
Machine Learning For Hackers <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">R

R code examples from the 'Machine Learning for Hackers' book, demonstrating practical machine learning techniques.

#code-examples#statistical-analysis#practical-ml
Stars3.7k
Forks2.2k
Last commit
Leafmap
LeafmapPython

A Python package for interactive mapping and geospatial analysis with minimal coding in Jupyter notebooks.

#open-source-gis#whiteboxtools#dataviz
Stars3.7k
Forks461
Last commit14 days ago
polyaxon
polyaxon

An open-source platform for building, training, and monitoring large-scale deep learning applications with full lifecycle MLOps.

#distributed-training#hyperparameter-tuning#workflow-orchestration
Stars3.7k
Forks325
Last commit10 days ago
ggplot
ggplotPython

A Python implementation of the grammar of graphics for creating statistical visualizations.

#matplotlib#statistical graphics#data-science
Stars3.7k
Forks563
Last commit3 years ago
chartify
chartifyPython

A Python library that simplifies chart creation for data scientists with consistent data formats and smart defaults.

#data-science#python#plotting
Stars3.6k
Forks329
Last commit1 year ago
Chartify
ChartifyPython

A Python library that simplifies chart creation for data scientists with consistent data formats and smart defaults.

#data-science#python#plotting
Stars3.6k
Forks329
Last commit1 year ago
Ploomber
PloomberPython

The fastest way to build data pipelines with iterative development and deployment anywhere.

#deployment#pipelines#airflow
Stars3.6k
Forks241
Last commit1 year ago
A Jupyter Notebook Blogging Platform Powered by GitHub Actions, Pages and Jekyll
A Jupyter Notebook Blogging Platform Powered by GitHub Actions, Pages and JekyllJupyter Notebook

An easy-to-use blogging platform with enhanced support for Jupyter Notebooks, Word docs, and Markdown, powered by GitHub Actions.

#actions#jekyll#data-science
Stars3.5k
Forks728
Last commit
ML Workspace
ML WorkspaceJupyter Notebook

A web-based IDE for machine learning and data science with pre-installed libraries and tools, deployable via Docker.

#jupyter-lab#data-science#deep-learning
Stars3.5k
Forks458
Last commit1 year ago
TensorWatch
TensorWatchJupyter Notebook

A debugging and visualization tool for data science, deep learning, and reinforcement learning in Jupyter Notebook.

#ai#data-science#deep-learning
Stars3.5k
Forks360
Last commit2 months ago
Data Science
Data Science

A curated list of Python software for data science, covering machine learning, deep learning, visualization, and data manipulation.

#data-science#deep-learning#awesome-list
Stars3.5k
Forks447
Last commit1 month ago
Ethen's Notebook Collection
Ethen's Notebook CollectionHTML

A comprehensive collection of machine learning tutorials and implementations in Python, covering algorithms from scratch to production deployment.

#python-tutorials#data-science#deep-learning
Stars3.4k
Forks668
Last commit
Koalas
KoalasPython

Koalas provides the pandas DataFrame API on Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.

#apache-spark#spark#mlflow
Stars3.4k
Forks371
Last commit2 years ago
xeus-cling
xeus-clingC++

A Jupyter kernel for C++ that enables interactive computing with cling interpreter and xeus protocol.

#xeus#jupyter-kernel#notebook
Stars3.3k
Forks317
Last commit7 months ago
pgmpy
pgmpyPython

A Python toolkit for causal and probabilistic reasoning using graphical models like Bayesian Networks and Structural Equation Models.

#causal-identification#bayesian-networks#simulation
Stars3.3k
Forks1.1k
Last commit1 day ago
mljar-supervised
mljar-supervisedPython

An Automated Machine Learning Python package for tabular data with feature engineering, hyperparameter tuning, explanations, and automatic documentation.

#fairness-ml#ensemble-learning#random-forest
Stars3.3k
Forks440
Last commit
NLP Roadmap
NLP Roadmap

A visual roadmap and keyword mind map for students learning Natural Language Processing, from basics to SOTA models.

#roadmap#keyword#data-science
Stars3.3k
Forks515
Last commit6 years ago
shapash
shapashJupyter Notebook

A Python library that makes machine learning models interpretable and transparent through user-friendly visualizations and a web application.

#shap#python-library#explainability
Stars3.2k
Forks387
Last commit12 days ago
.NET Interactive
.NET InteractiveC#

An engine and API for running .NET and other languages interactively in notebooks, REPLs, and embedded coding experiences.

#multi-language#embedded-scripting#data-science
Stars3.2k
Forks459
Last commit1 month ago
tslearn
tslearnPython

A Python machine learning toolkit for time series analysis with scikit-learn compatible API.

#python-library#data-science#machine-learning-algorithms
Stars3.2k
Forks371
Last commit5 days ago
SweetViz
SweetVizPython

A Python library for automated exploratory data analysis (EDA) with high-density visualizations and target analysis in two lines of code.

#statistical-analysis#data-science#automated-reporting
Stars3.1k
Forks287
Last commit1 month ago
xLearn
xLearnC++

A high-performance, easy-to-use, and scalable machine learning package for linear models, factorization machines, and field-aware factorization machines.

#ffm#high-performance#python-library
Stars3.1k
Forks515
Last commit2 years ago
PreviousPage 5 of 15

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
2 years ago
1 month ago
2 years ago
7 years ago
3 years ago
3 days ago
10 days ago
Next
#Machine Learning288
#Python245
#Deep Learning84
#Data Analysis79
#Data Visualization79
#Statistics61
#Python Library55
#Jupyter Notebook53
#R52
#Jupyter49
#Scikit Learn48
#Pandas43