Showing 36 of 484 projects
A Ruby kernel for Jupyter notebooks, enabling interactive data science and computational workflows in Ruby.
A fast, ergonomic machine learning library for Rust with broad algorithm coverage and WASM-first defaults.
Learn statistics through Python with real-world examples like analyzing marijuana price data across US states.
A Ruby machine learning library with a Scikit-Learn-like interface for classification, regression, clustering, and dimensionality reduction.
A curated collection of resources for Go-based data analysis, visualization, machine learning, and data science.
An R package for the quantitative analysis of textual data, providing comprehensive tools for natural language processing and text management.
An automated feature generation framework for tabular data that discovers expert-level features to boost machine learning model performance.
A curated list of awesome cheminformatics software, libraries, resources, and tools, primarily command-line based and open-source.
A Jupyter kernel for Clojure, enabling Clojure code execution in Jupyter Lab, Notebook, and Console.
A curated list of resources for R Shiny, including tutorials, packages, deployment guides, and app examples.
Convert IPython/Jupyter notebooks to markdown and back, enabling seamless editing of notebooks as markdown files.
A JupyterLab extension that integrates GPT-4 as a code interpreter, translating natural language to Python and executing it automatically.
A comprehensive statistical computation library for Rust, providing distributions, functions, and utilities for scientific computing.
A pure Java machine learning library with no external dependencies, offering a wide collection of algorithms and parallel execution support.
An open-source image analysis software package for plant phenotyping using computer vision.
A modular deep learning framework for PyTorch to build neural networks on heterogeneous tabular data.
An R package providing comprehensive historical soccer match datasets and analysis functions for European and MLS leagues.
An R binding package for calling Google Earth Engine API from within R, integrating with the R spatial ecosystem.
A biomedical knowledge graph integrating 20 resources to describe 17,080 diseases with over 4 million relationships across ten biological scales.
A Neovim plugin that provides real-time, bidirectional synchronization with Jupyter Notebook using Selenium automation.
A Python library for introductory data science education, developed for Berkeley's Data 8 course.
An R package for detecting statistically significant breakpoints in time series using robust energy statistics.
A Python library that brings R's dplyr data manipulation syntax to pandas DataFrames using a pipe operator.
An open-source toolkit for auditing bias and experimenting with fairness methods in machine learning models.
A collection of R packages for interacting with Hadoop ecosystems, enabling big data analysis from R.
A Jupyter Notebook kernel for interactive data exploration and analysis using Apache Spark with Scala.
A high-performance, functional tabular data processing library for Clojure, similar to Python's Pandas or R's data.table.
A Nix-based framework for creating declarative and reproducible Jupyter environments with configurable kernels and extensions.
A collection of Jupyter notebooks providing examples and tutorials for the Bokeh interactive visualization library.
A tool to package, serve, and deploy any ML model on any platform using a GitOps approach.
A modern C++ toolkit for text retrieval and analysis, featuring indexing, ranking, topic modeling, classification, and language models.
IPython-based environment for reproducible machine learning research with unified wrappers for multiple ML libraries.
A Python package for stacking (stacked generalization) with both functional and scikit-learn compatible APIs.
A curated list of proven AI use cases that generate business value across departments and industries.
A hyperparameter-free gradient boosting machine with a simple budget parameter, built for high performance with Rust and bindings for Python and R.
A quick reference guide to the most commonly used patterns and functions in PySpark SQL.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.