Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Gpu Acceleration

Gpu Acceleration

66 projects

Showing 30 of 66 projects

DSSTNE - Amazon's library for building Deep Learning models
DSSTNE - Amazon's library for building Deep Learning modelsC++

An open-source library for training and deploying deep learning recommendation models with sparse data at scale using multi-GPU support.

#multi-gpu#sparse-data#deep-learning
Stars4.4k
Forks727
Last commit
shimmy
shimmyRust

A lightweight, single-binary Rust inference server providing 100% OpenAI-API compatible endpoints for local GGUF models.

#safetensors#privacy-first#lora
Stars4.0k
Forks348
Last commit29 days ago
oneDNN
oneDNNC++

An open-source cross-platform performance library of basic building blocks for deep learning applications, optimized for CPUs and GPUs.

#oneapi#neural-network#jit-compilation
Stars4.0k
Forks1.1k
Last commit1 day ago
GPyTorch
GPyTorchPython

A highly efficient, scalable Gaussian process library implemented in PyTorch with GPU acceleration and modular design.

#probabilistic-modeling#gpu-acceleration#numerical-linear-algebra
Stars3.9k
Forks590
Last commit12 days ago
Neon - Python based Deep Learning Framework
Neon - Python based Deep Learning FrameworkPython

Intel's reference deep learning framework designed for high performance across CPUs, GPUs, and custom hardware.

#fast#neural-network#intel-mkl
Stars3.9k
Forks808
Last commit
implicit
implicitPython

Fast Python library for collaborative filtering recommendation algorithms on implicit feedback datasets.

#cython#recommender-system#recommender-systems
Stars3.8k
Forks628
Last commit1 year ago
BoTorch
BoTorchJupyter Notebook

A modular library for Bayesian optimization built on PyTorch, enabling efficient optimization of expensive black-box functions.

#probabilistic-models#bayesian-optimization#gpu-acceleration
Stars3.5k
Forks474
Last commit4 days ago
StringZilla
StringZillaC

A high-performance string library leveraging SIMD and SWAR to accelerate search, hashing, sorting, and edit distances across C, C++, Python, Rust, and more.

#memory-mapping#substring#information-retrieval
Stars3.4k
Forks123
Last commit1 month ago
BRAX
BRAXJupyter Notebook

A fast, differentiable physics engine built with JAX for massively parallel rigid body simulation on accelerator hardware.

#robotics#research-tool#jax
Stars3.1k
Forks338
Last commit25 days ago
ChartGPU
ChartGPUTypeScript

A WebGPU-accelerated TypeScript charting library for rendering millions of data points at 60 FPS with interactive dashboards.

#real-time-dashboards#open-source#high-performance
Stars3.0k
Forks91
Last commit5 days ago
Starling
StarlingActionScript

A lightweight, open-source 2D game engine for ActionScript 3 that leverages GPU acceleration via Stage3D for cross-platform deployment.

#actionscript-3#mobile-games#starling-framework
Stars3.0k
Forks812
Last commit2 months ago
TorchAudio
TorchAudioPython

An audio library for PyTorch providing data manipulation, transformations, and dataset loaders for machine learning applications.

#deep-learning#signal-processing#gpu-acceleration
Stars2.9k
Forks770
Last commit2 days ago
GPUImage3
GPUImage3Swift

A Swift framework for GPU-accelerated image and video processing on Apple platforms using Metal.

#realtime#ios#graphics
Stars2.9k
Forks368
Last commit1 year ago
NumPyro
NumPyroPython

A lightweight probabilistic programming library using NumPy and JAX for autograd and JIT compilation to GPU/TPU/CPU.

#variational-inference#jax#gpu-acceleration
Stars2.7k
Forks282
Last commit7 days ago
Vulkan Kompute
Vulkan KomputeC++

A general-purpose GPU compute framework built on Vulkan for cross-vendor graphics cards, enabling high-performance data processing and machine learning.

#vulkan#parallel-computing#gpu-compute
Stars2.5k
Forks190
Last commit11 days ago
PyGraphistry
PyGraphistryPython

A Python library for loading, shaping, embedding, and exploring large graphs with GPU-accelerated visualization and analytics.

#networkx#graph#graph-query-language
Stars2.5k
Forks226
Last commit1 day ago
hyperlearn
hyperlearnJupyter Notebook

HyperLearn provides 2-2000x faster machine learning algorithms with 50% less memory usage, optimized for all hardware.

#parallel-computing#high-performance#python-library
Stars2.4k
Forks158
Last commit1 year ago
PhysX
PhysXC++

A scalable multi-platform physics simulation SDK for real-time collision detection, rigid body dynamics, and character controllers.

#character-controller#simulation#collision-detection
Stars2.4k
Forks292
Last commit3 years ago
libcudacxx
libcudacxxC++

NVIDIA's implementation of the C++ Standard Library for CUDA C++ development.

#cuda#parallel-computing#high-performance-computing
Stars2.3k
Forks191
Last commit2 years ago
MetalPetal
MetalPetalObjective-C

A GPU-accelerated image and video processing framework for Apple platforms built on Metal.

#ios#filter#video-processing
Stars2.1k
Forks285
Last commit2 years ago
OpenImageDenoise
OpenImageDenoiseC++

An open-source library of high-performance, high-quality denoising filters for ray-traced images using deep learning.

#image-denoising#deep-learning#gpu-acceleration
Stars2.0k
Forks191
Last commit2 days ago
dfdx
dfdxRust

A deep learning library for Rust featuring shape-checked tensors and neural networks with compile-time safety.

#cuda#tensor-library#neural-network
Stars1.9k
Forks104
Last commit1 year ago
dfdx
dfdxRust

A deep learning library in Rust featuring shape-checked tensors and neural networks with compile-time safety.

#cuda#tensor-library#neural-network
Stars1.9k
Forks104
Last commit1 year ago
pymatting
pymattingPython

A Python library implementing multiple alpha matting algorithms for extracting foreground objects from images.

#python-library#scipy#numba
Stars1.9k
Forks226
Last commit17 days ago
Phenomenon
PhenomenonTypeScript

A fast 2kB low-level WebGL library for GPU-accelerated particle systems and high-performance visual effects.

#particles#shaders#visual-effects
Stars1.8k
Forks45
Last commit2 years ago
emacs-ng
emacs-ngEmacs Lisp

A fork of Emacs that adds modern features like TypeScript/JavaScript support via Deno, GPU-accelerated rendering with WebRender, and improved async I/O.

#emacs#webassembly#rust-integration
Stars1.8k
Forks75
Last commit1 month ago
Bender
BenderSwift

An abstraction layer over MetalPerformanceShaders for crafting and running fast neural networks on iOS using TensorFlow models.

#apple#ios#model-conversion
Stars1.8k
Forks89
Last commit2 years ago
ILGPU
ILGPUC#

A JIT compiler for writing high-performance GPU programs in .NET languages like C#, offering CUDA-level performance with C# convenience.

#cuda#parallel-computing#intel
Stars1.7k
Forks140
Last commit2 days ago
ThunderSVM
ThunderSVMC++

A fast Support Vector Machine (SVM) library that leverages GPUs and multi-core CPUs for high-performance machine learning.

#cuda#libsvm#high-performance-computing
Stars1.6k
Forks223
Last commit2 years ago
emu
emuRust

A write-once-run-anywhere GPGPU library for Rust that abstracts WebGPU for CUDA-like compute with portability across desktop, mobile, and browser.

#webgpu#compute-shader#async-gpu
Stars1.6k
Forks52
Last commit3 years ago
PreviousPage 2 of 2

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
6 years ago
5 years ago
#Machine Learning41
#Deep Learning27
#Python22
#Neural Networks19
#Gpu15
#Neural Network14
#Cuda14
#Python Library10
#Cross Platform9
#Pytorch9
#Computer Vision9
#C Plus Plus7