Showing 36 of 506 projects
A deep learning framework for Julia with GPU support and automatic differentiation using dynamic computational graphs.
A curated list of awesome Python frameworks, libraries, software, and resources for chemistry and cheminformatics.
A visual, low-code data preparation tool that generates Python code for ETL, reporting, and AI-assisted workflows.
A curated collection of open-source computer vision pre-trained models across TensorFlow, Keras, PyTorch, Caffe, and MXNet frameworks.
Jupyter magics and kernels for interactively working with remote Spark clusters via Livy, Lighter, or Ilum.
A curated collection of tools, tutorials, code, and resources for Earth Observation and geospatial satellite imagery analysis.
A curated collection of 500+ resources for data analysis and data science, covering Python, SQL, ML, visualization, roadmaps, and interview prep.
A free software AI accelerator that speeds up scikit-learn applications by 10-100x on CPUs and GPUs with no code changes.
A Java/Groovy/JavaFX data visualization tool for ETL, machine learning, and publishing web visualizations.
Transpile trained scikit-learn estimators to C, Java, JavaScript, Go, PHP, and Ruby for embedded systems and performance-critical applications.
A real-time AI lakehouse platform with a Python-centric feature store and comprehensive MLOps capabilities.
A lightweight and intuitive Go library for data manipulation, statistics, and machine learning using DataFrames.
A web-based tool for automated hyperparameter tuning and stacked ensemble creation in Python.
A general-purpose machine learning library for Rust, focusing on speed and ease of use with minimal dependencies.
A curated list of resources for random forest and other tree-based machine learning methods.
A Python framework for scalable time series forecasting using machine learning models, designed for production environments.
A ranked list of 300+ awesome Jupyter Notebook, Hub, and Lab projects (extensions, kernels, tools) updated weekly.
A flexible Python framework for fast network flow data analysis, offering encrypted application identification, statistical feature extraction, and extensibility via plugins.
A Python library that adds dynamic data aggregation to Plotly figures for scalable visualization of large time series.
A comprehensive Julia package for probability distributions, providing properties, PDFs, sampling, and maximum likelihood estimation.
A Neovim plugin for interactively running code with Jupyter kernels, providing a REPL and notebook-like experience directly in the editor.
A Bayesian marketing analytics toolbox for Media Mix Modeling (MMM), Customer Lifetime Value (CLV), and customer choice analysis.
A cleaned and normalized time series dataset of global COVID-19 confirmed cases, deaths, and recoveries, updated daily.
renv creates isolated, portable, and reproducible project environments for R by managing private package libraries and lockfiles.
A Ruby library that enables direct calling of Python functions and modules with automatic type conversion.
An open-source machine learning system for the end-to-end data science lifecycle from data preparation to model serving.
An open-source Java framework for rapid development of machine learning and statistical applications with large dataset support.
A modern, object-oriented machine learning framework for R, providing efficient building blocks for ML workflows.
A Go kernel for Jupyter notebooks that compiles each cell for fast execution and full Go compatibility.
A Python package providing specialized statistical algorithms for graph and network analysis.
A Python library for building Generalized Additive Models (GAMs) with a scikit-learn-like API, emphasizing interpretability and performance.
A meta gem that bundles scientific computing and visualization libraries for Ruby, enabling data analysis and plotting.
A PyTorch-based framework for training and validating models that produce high-quality embeddings for metric learning and retrieval tasks.
An R interface for Apache Spark that enables distributed data processing, machine learning, and SQL queries using familiar R syntax.
A scikit-learn compatible Python module for multi-label classification tasks.
A strongly-typed Scala API for TensorFlow, providing functionality similar to the official Python API with additional features.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.