Data Science

#statistical-analysis#variational-inference#jax

Forks1.2k

Last commit3 years ago

OryxJupyter Notebook

A library for probabilistic reasoning and statistical analysis integrated with TensorFlow and JAX.

#chart#developer-tools#open-source

Forks1.1k

Last commit17 days ago

Data Visualization

A curated list of awesome open-source data visualization libraries, frameworks, and resources across multiple programming languages.

Forks466

A curated list of awesome data visualization libraries and resources.

A curated list of awesome open-source data visualization libraries, frameworks, and resources across multiple programming languages.

#chart#developer-tools#open-source

Forks466

#chart#developer-tools#open-source

Data Visualization

A curated list of awesome open-source data visualization libraries, frameworks, and resources across multiple programming languages.

Forks466

#jupyterlab-extension#jupyter-lab#data-science

MercuryPython

A framework for building interactive web applications directly from Python notebooks, including chats, AI agents, dashboards, and reports.

Stars4.3k

Forks292

Last commit15 days ago

JupyterLab DesktopTypeScript

A cross-platform desktop application for JupyterLab, providing the easiest way to run Jupyter notebooks locally.

#desktop-application#notebook#data-science

Stars4.3k

Forks475

Last commit1 day ago

pattern_classificationJupyter Notebook

A comprehensive collection of tutorials, examples, and resources for understanding and solving machine learning and pattern classification problems.

#educational-resources#data-science#machine-learning-algorithms

An open-source CLI tool for implementing CI/CD workflows with a focus on MLOps, automating ML experiments and reporting.

#developer-tools#devops#cicd

Stars4.2k

Forks345

"most important thing in data science is the question"HTML

Course materials for the Johns Hopkins Data Science Specialization on Coursera.

#coursera#data-science#statistics

Stars4.2k

Forks31.0k

Last commit5 years ago

Data Science SpecializationHTML

Course materials for the Johns Hopkins Data Science Specialization on Coursera.

#coursera#data-science#statistics

Stars4.2k

Forks31.0k

Last commit5 years ago

aws-sdk-pandasPython

A Python library that simplifies data integration between pandas and AWS services like Athena, S3, Redshift, and more.

#apache-arrow#data-science#glue-catalog

Stars4.1k

Forks737

Last commit3 days ago

aws-data-wranglerPython

A Python library that simplifies data integration between pandas and AWS services like Athena, S3, Redshift, and more.

#apache-arrow#data-science#redshift

Stars4.1k

Forks737

Last commit3 days ago

Network AnalysisR

A curated list of resources for constructing, analyzing, and visualizing network data across various disciplines.

#semantic-networks#data-science#complex-networks

Stars4.1k

Forks637

Last commit

Awesome Machine Learning Interpretability

A curated list of practical resources for responsible machine learning, covering interpretability, governance, safety, and ethics.

#ai-safety#xai#model-auditing

Stars4.0k

Forks629

Last commit1 month ago

deepchecksPython

An open-source solution for continuous validation of machine learning models and data, from research to production.

#data-testing#ml-validation#python-library

Stars4.0k

Forks302

Last commit6 months ago

HydrogenTypeScript

Run code interactively, inspect data, and plot using Jupyter kernels directly inside the Atom text editor.

#data-science#atom#repl

Stars4.0k

Forks340

Last commit5 days ago

Introduction to machine learning with scikit-learnJupyter Notebook

A collection of Jupyter notebooks accompanying a 10-part video series teaching machine learning with Python's scikit-learn library.

#video-tutorials#educational#ml-workflow

A Java dataframe and visualization library for data loading, cleaning, transformation, and analysis.

#statistical-analysis#chart#data-science

A Jupyter/IPython extension that transforms notebooks into interactive Reveal.js slideshows with live execution.

#notebook-tools#reveal-js#presentation-tool

Forks410

Machine Learning For Hackers <img class="emoji" alt="heart" src="https://cdn.jsdelivr.net/gh/qinwf/awesome-R@3c66da6e291bcc0520b1649125b0bed750896a9a/heart.png" height="20" align="absmiddle" width="20">R

R code examples from the 'Machine Learning for Hackers' book, demonstrating practical machine learning techniques.

#code-examples#statistical-analysis#practical-ml

A Python package for interactive mapping and geospatial analysis with minimal coding in Jupyter notebooks.

#open-source-gis#whiteboxtools#dataviz

#distributed-training#hyperparameter-tuning#workflow-orchestration

Forks465

Last commit4 days ago

polyaxonMDX

An open-source platform for building, training, and monitoring large-scale deep learning applications with full lifecycle MLOps.

#matplotlib#statistical graphics#data-science

Forks329

Last commit9 days ago

ggplotPython

A Python implementation of the grammar of graphics for creating statistical visualizations.

#data-science#python#plotting

Forks562

Last commit3 years ago

chartifyPython

A Python library that simplifies chart creation for data scientists with consistent data formats and smart defaults.

Stars3.6k

Forks332

#data-science#python#plotting

ChartifyPython

A Python library that simplifies chart creation for data scientists with consistent data formats and smart defaults.

Stars3.6k

Forks332

#deployment#pipelines#airflow

PloomberPython

The fastest way to build data pipelines with iterative development and deployment anywhere.

Stars3.6k

Forks242

A Jupyter Notebook Blogging Platform Powered by GitHub Actions, Pages and JekyllJupyter Notebook

An easy-to-use blogging platform with enhanced support for Jupyter Notebooks, Word docs, and Markdown, powered by GitHub Actions.

#actions#jekyll#data-science

ML WorkspaceJupyter Notebook

Forks728

Last commit

A web-based IDE for machine learning and data science with pre-installed libraries and tools, deployable via Docker.

#jupyter-lab#data-science#deep-learning

Forks458

#data-science#deep-learning#awesome-list

Data Science

A curated list of Python software for data science, covering machine learning, deep learning, visualization, and data manipulation.

Ethen's Notebook CollectionHTML

Forks454

Last commit3 months ago

A comprehensive collection of machine learning tutorials and implementations in Python, covering algorithms from scratch to production deployment.

#python-tutorials#data-science#deep-learning

TensorWatchJupyter Notebook

Forks675

Last commit

A debugging and visualization tool for data science, deep learning, and reinforcement learning in Jupyter Notebook.

#ai#data-science#deep-learning

#apache-spark#spark#mlflow

Forks362

Last commit3 months ago

KoalasPython

Koalas provides the pandas DataFrame API on Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.

Stars3.4k

Forks369