Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Data Science

Data Science

243 projects

Showing 36 of 243 projects

Gephi Datasets
Gephi DatasetsJava

An award-winning open-source platform for visualizing and manipulating large graphs and networks.

#graph#gephi-toolkit#opengl
Stars6.5k
Forks1.6k
Last commit2 days ago
Gephi
GephiJava

An open-source platform for visualizing and manipulating large graphs and networks with real-time performance.

#graph#open-source#opengl
Stars6.5k
Forks1.6k
Last commit2 days ago
R
RR

A curated list of awesome R packages, frameworks, and software for data science and statistical computing.

#data-science#r packages#web-technologies
Stars6.4k
Forks1.5k
Last commit7 months ago
Posts
PostsR

A curated list of awesome R packages, frameworks, and software for data science and statistical computing.

#finance-analysis#data-science#r packages
Stars6.4k
Forks1.5k
Last commit7 months ago
papermill
papermillPython

A Python tool for parameterizing, executing, and analyzing Jupyter Notebooks at scale.

#julia#notebook#publishing
Stars6.4k
Forks449
Last commit18 days ago
Aim
AimPython

An open-source, self-hosted ML experiment tracker with a performant UI and SDK for comparing and querying training runs.

#ai#metadata-logging#open-source
Stars6.1k
Forks386
Last commit2 days ago
TensorFlow tutorials
TensorFlow tutorialsJupyter Notebook

A collection of simple tutorials introducing deep learning concepts using Google's TensorFlow framework.

#ai#educational#data-science
Stars6.0k
Forks1.5k
Last commit2 years ago
snorkel
snorkelPython

A Python library for programmatically building and managing training data using weak supervision.

#programmatic-labeling#weak-supervision#ai
Stars6.0k
Forks855
Last commit14 days ago
voila
voilaPython

Voilà converts Jupyter notebooks into secure, standalone web applications with interactive widgets.

#jupyterlab-extension#notebook#data-science
Stars5.9k
Forks526
Last commit2 days ago
Voila
VoilaPython

Voilà transforms Jupyter notebooks into secure, standalone web applications with interactive widgets.

#jupyterlab-extension#deployment#notebook
Stars5.9k
Forks526
Last commit2 days ago
causalml
causalmlPython

A Python package for uplift modeling and causal inference using machine learning algorithms to estimate treatment effects.

#data-science#treatment-effects#experimental-design
Stars5.8k
Forks857
Last commit1 month ago
River
RiverPython

A Python library for online machine learning, designed for streaming data with a focus on user experience.

#online-machine-learning#python-library#data-science
Stars5.8k
Forks624
Last commit2 days ago
Curated list of Python tutorials for Data Science, NLP and Machine Learning
Curated list of Python tutorials for Data Science, NLP and Machine LearningPython

A curated collection of Python tutorials and resources for data science, machine learning, and natural language processing.

#python-tutorials#educational-resources#data-science
Stars5.8k
Forks1.5k
Last commit
shiny <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">
shiny <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">R

An R package for building interactive web applications without requiring HTML, CSS, or JavaScript knowledge.

#web-app#r-package#data-science
Stars5.6k
Forks1.9k
Last commit
mlpack
mlpackC++

A fast, header-only C++ machine learning library with bindings for Python, R, Julia, and Go.

#hacktoberfest#scientific-computing#machine-learning-library
Stars5.6k
Forks1.7k
Last commit9 days ago
lux
luxPython

A Python library that automates data visualization and exploration for pandas dataframes in Jupyter notebooks.

#data-science#altair#interactive-widget
Stars5.4k
Forks380
Last commit2 years ago
zenml
zenmlPython

An open-source MLOps platform for building, orchestrating, and deploying production AI pipelines and agents.

#pipelines#ai-pipelines#data-science
Stars5.4k
Forks607
Last commit2 days ago
SQLFlow
SQLFlowGo

A compiler that extends SQL with AI capabilities to train, predict, and evaluate machine learning models directly from SQL statements.

#ai#argo-workflows#sql-ai
Stars5.2k
Forks705
Last commit2 years ago
cuML
cuMLC++

A suite of GPU-accelerated machine learning algorithms with scikit-learn compatible APIs for 10-50x faster performance on large datasets.

#cuda#data-science#nvidia
Stars5.2k
Forks622
Last commit1 day ago
MLxtend
MLxtendPython

A Python library providing extensions and utilities for data science and machine learning tasks.

#ensemble-learning#scientific-computing#feature-selection
Stars5.1k
Forks902
Last commit3 months ago
machine-learning-book
machine-learning-bookJupyter Notebook

Code repository for the 'Machine Learning with PyTorch and Scikit-Learn' book, providing practical examples and notebooks.

#code-examples#data-science#deep-learning
Stars5.1k
Forks1.8k
Last commit3 months ago
geopandas
geopandasPython

A Python library that extends pandas to work with geographic data, enabling spatial operations and analysis.

#open-source-gis#pandas-extension#data-science
Stars5.1k
Forks1.0k
Last commit11 days ago
R for Data Science, 2E
R for Data Science, 2ER

An open-source book teaching data science using R, covering data import, transformation, visualization, and modeling.

#bookdown#data-science#statistics
Stars5.0k
Forks4.4k
Last commit16 days ago
dplyr <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">
dplyr <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">R

A grammar of data manipulation for R, providing a consistent set of verbs to solve common data manipulation challenges.

#r-package#data-science#data-wrangling
Stars5.0k
Forks2.1k
Last commit
R Studio
R StudioJava

An integrated development environment (IDE) for the R programming language with a comprehensive workbench and server capabilities.

#data-science#r-language#research-tools
Stars5.0k
Forks1.2k
Last commit2 days ago
matplotplusplus
matplotplusplusC++

A C++ graphics library for data visualization with interactive plotting, high-quality export, and dozens of plot categories.

#scientific-visualization#scientific-computing#graphics
Stars4.9k
Forks376
Last commit22 days ago
datascience
datascience

A curated collection of Python libraries, tutorials, and tools for data science, from data wrangling to machine learning and visualization.

#data-science#statistics#deep-learning
Stars4.6k
Forks708
Last commit21 days ago
Awesome Jupyter
Awesome Jupyter

A curated list of awesome Jupyter projects, libraries, and resources for data science and interactive computing.

#jupyterlab-extension#notebook-tools#jupyterhub
Stars4.6k
Forks454
Last commit2 days ago
markusschanta/awesome-jupyter, "Hosted Notebook Solutions"
markusschanta/awesome-jupyter, "Hosted Notebook Solutions"

A curated list of awesome Jupyter projects, libraries, and resources for data science and interactive computing.

#jupyterlab-extension#notebook-tools#jupyterhub
Stars4.6k
Forks454
Last commit2 days ago
TensorFlow Book
TensorFlow BookJupyter Notebook

Official code repository for the 'Machine Learning with TensorFlow' book with practical examples.

#code-examples#autoencoder#data-science
Stars4.4k
Forks1.2k
Last commit3 years ago
Oryx
OryxJupyter Notebook

A library for probabilistic reasoning and statistical analysis integrated with TensorFlow and JAX.

#statistical-analysis#variational-inference#jax
Stars4.4k
Forks1.1k
Last commit8 days ago
A curated list of awesome data visualization libraries and resources.
A curated list of awesome data visualization libraries and resources.

A curated list of awesome open-source data visualization libraries, frameworks, and resources across multiple programming languages.

#developer-tools#open-source#data-science
Stars4.3k
Forks455
Last commit2 years ago
Data Visualization
Data Visualization

A curated list of awesome open-source data visualization libraries, frameworks, and resources across multiple programming languages.

#chart#developer-tools#open-source
Stars4.3k
Forks455
Last commit2 years ago
Data Visualization
Data Visualization

A curated list of awesome open-source data visualization libraries, frameworks, and resources across multiple programming languages.

#chart#developer-tools#open-source
Stars4.3k
Forks455
Last commit2 years ago
Mercury
MercuryPython

A framework for building interactive web applications directly from Python notebooks, including chats, AI agents, dashboards, and reports.

#jupyterlab-extension#jupyter-lab#data-science
Stars4.3k
Forks280
Last commit7 days ago
pattern_classification
pattern_classificationJupyter Notebook

A comprehensive collection of tutorials, examples, and resources for understanding and solving machine learning and pattern classification problems.

#educational-resources#data-science#machine-learning-algorithms
Stars4.2k
Forks1.3k
Last commit
PreviousPage 4 of 7

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
2 years ago
2 days ago
17 days ago
2 years ago
Next
#Machine Learning154
#Python151
#Deep Learning58
#Data Visualization50
#Python Library36
#Data Analysis36
#Jupyter Notebook32
#Scikit Learn32
#Statistics30
#Jupyter Notebooks27
#Jupyter26
#R24