Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Tags
  3. Cuda

Cuda

44 projects

Showing 36 of 44 projects

llama.cpp
llama.cppC++

A C/C++ library for efficient, cross-platform LLM inference with extensive hardware support and quantization.

#cuda#ggml#metal
Stars105.8k
Forks17.2k
Last commit1 day ago
vllm
vllmPython

A high-throughput, memory-efficient inference and serving engine for large language models (LLMs).

#distributed-inference#transformer#cuda
Stars77.8k
Forks16.0k
Last commit1 day ago
Caffe Model Zoo
Caffe Model ZooC++

A fast open framework for deep learning with a focus on expression, speed, and modularity.

#cuda#deep-learning#neural-networks
Stars34.6k
Forks18.5k
Last commit1 year ago
Openpose
OpenposeC++

Real-time multi-person keypoint detection library for body, face, hands, and foot estimation.

#cuda#pose-estimation#human-behavior-understanding
Stars34.0k
Forks8.1k
Last commit1 year ago
GitHub repository
GitHub repositoryC++

An open-source, high-performance platform for developing, testing, and deploying autonomous vehicles.

#lidar#robotics#autonomous-driving
Stars26.6k
Forks9.9k
Last commit8 days ago
Darknet
DarknetC

An open source neural network framework in C and CUDA, known for YOLO real-time object detection models.

#cuda#deep-learning#c-language
Stars26.4k
Forks21.1k
Last commit2 years ago
sglang
sglangPython

A high-performance serving framework for large language models and multimodal models, delivering low-latency and high-throughput inference.

#transformer#cuda#llm-serving
Stars26.3k
Forks5.5k
Last commit1 day ago
Buzz
BuzzPython

An offline desktop application for transcribing and translating audio/video files, live recordings, and YouTube links using OpenAI's Whisper.

#vulkan#cuda#desktop-application
Stars18.8k
Forks1.4k
Last commit2 days ago
CNTK - Microsoft Cognitive Toolkit
CNTK - Microsoft Cognitive ToolkitC++

A unified deep learning toolkit for describing neural networks as computational graphs, supporting feed-forward DNNs, CNNs, and RNNs/LSTMs.

#cntk#cognitive-toolkit#cuda
Stars17.6k
Forks4.2k
Last commit3 years ago
nvidia-docker
nvidia-docker

A deprecated wrapper that enabled Docker containers to access NVIDIA GPU resources.

#nvidia-docker#cuda#container-runtime
Stars17.5k
Forks2.1k
Last commit2 years ago
Kaldi
KaldiShell

A comprehensive open-source toolkit for speech recognition research and development.

#cuda#research-toolkit#speaker-id
Stars15.4k
Forks5.4k
Last commit7 months ago
TensorRT
TensorRTC++

NVIDIA's SDK for high-performance deep learning inference optimization and deployment on NVIDIA GPUs.

#cuda#neural-network#nvidia
Stars12.9k
Forks2.3k
Last commit10 days ago
Taskflow
TaskflowC++

A fast, expressive, and header-only C++ library for building task-parallel programs with static, dynamic, and conditional task graphs.

#work-stealing#threadpool#cuda
Stars11.9k
Forks1.4k
Last commit2 days ago
cupy
cupyPython

A NumPy/SciPy-compatible array library for GPU-accelerated computing with Python, supporting NVIDIA CUDA and AMD ROCm.

#cuda#scientific-computing#high-performance-computing
Stars10.9k
Forks1.0k
Last commit2 days ago
cudf
cudfC++

A GPU-accelerated DataFrame library for tabular data processing, part of the RAPIDS data science suite.

#cudf#cuda#apache-arrow
Stars9.6k
Forks1.0k
Last commit1 day ago
gocv
gocvGo

Go language bindings for OpenCV 4, enabling computer vision applications with support for CUDA, DNN, and OpenVINO.

#cuda#video-processing#opencv
Stars7.4k
Forks902
Last commit2 months ago
Chainer
ChainerPython

A flexible Python deep learning framework using define-by-run dynamic computational graphs for neural network research.

#research-tool#cuda#chainer
Stars5.9k
Forks1.4k
Last commit2 years ago
leaf
leafRust

An open-source machine learning framework for building classical, deep, or hybrid ML applications with a focus on performance and portability.

#cuda#opencl#deep-learning
Stars5.5k
Forks269
Last commit2 years ago
cuML
cuMLC++

A suite of GPU-accelerated machine learning algorithms with scikit-learn compatible APIs for 10-50x faster performance on large datasets.

#cuda#data-science#nvidia
Stars5.2k
Forks622
Last commit1 day ago
GitHub repository
GitHub repositoryPython

A PyTorch library providing GPU-accelerated tools for 3D deep learning, including differentiable rendering and geometric operations.

#cuda#rasterization#differentiable-lighting
Stars5.1k
Forks619
Last commit2 days ago
Thrust
ThrustC++

A C++ parallel algorithms library that enables high-performance computing on GPUs and multicore CPUs with a productivity-focused interface.

#cuda#parallel-computing#high-performance-computing
Stars5.0k
Forks759
Last commit2 years ago
ArrayFire
ArrayFireC++

A general-purpose tensor library for parallel computing across CPUs, GPUs, and hardware accelerators.

#oneapi#cuda#scientific-computing
Stars4.9k
Forks548
Last commit1 month ago
NCCL
NCCLC++

A library of optimized communication primitives for multi-GPU and multi-node collective operations.

#multi-gpu#cuda#distributed-training
Stars4.6k
Forks1.2k
Last commit1 day ago
jetson-containers
jetson-containersJupyter Notebook

A modular container build system providing the latest AI/ML packages for NVIDIA Jetson and JetPack-L4T.

#robotics#cuda#ros-containers
Stars4.6k
Forks823
Last commit4 days ago
Warp-CTC
Warp-CTCCuda

A fast parallel implementation of the Connectionist Temporal Classification (CTC) loss function for CPU and GPU.

#cuda#parallel-computing#torch-binding
Stars4.1k
Forks1.0k
Last commit2 years ago
Lygia
LygiaGLSL

A granular, multi-language shader library for real-time graphics, supporting GLSL, HLSL, Metal, WGSL, and CUDA.

#cuda#real-time-graphics#library
Stars3.3k
Forks213
Last commit1 month ago
Remotery
RemoteryC

A realtime CPU/GPU profiler hosted in a single C file with a remote web viewer for performance analysis.

#c-library#vulkan#cuda
Stars3.3k
Forks284
Last commit1 year ago
flownet2-pytorch
flownet2-pytorchPython

PyTorch implementation of FlowNet 2.0 for optical flow estimation using deep neural networks.

#cuda#deep-learning#neural-networks
Stars3.3k
Forks747
Last commit25 days ago
nnabla
nnablaPython

A deep learning framework for research, development, and production with flexible Python API and C++ core.

#cuda#model-training#deep-learning
Stars2.8k
Forks335
Last commit7 months ago
Kokkos
KokkosC++

A C++ programming model for writing performance-portable applications targeting all major HPC platforms.

#cuda#sycl#parallel-computing
Stars2.5k
Forks494
Last commit2 days ago
darknet_ros
darknet_rosC++

A ROS package for real-time object detection in camera images using YOLO (V3) on GPU and CPU.

#cuda#camera#autonomous-robots
Stars2.4k
Forks1.2k
Last commit1 year ago
darknet_ros
darknet_rosC++

A ROS package for real-time object detection in camera images using YOLO (V3) on GPU and CPU.

#robotics#cuda#opencv
Stars2.4k
Forks1.2k
Last commit1 year ago
EGO-Planner
EGO-PlannerC++

A lightweight gradient-based local planner for quadrotors that eliminates ESDF construction, achieving planning times around 1ms.

#cuda#gradient-based-optimization#real-time-planning
Stars2.4k
Forks385
Last commit1 year ago
libcudacxx
libcudacxxC++

NVIDIA's implementation of the C++ Standard Library for CUDA C++ development.

#cuda#parallel-computing#high-performance-computing
Stars2.3k
Forks191
Last commit2 years ago
envd
envdGo

A command-line tool for creating reproducible, container-based development environments for AI/ML workflows.

#cuda#hacktoberfest#developer-tools
Stars2.2k
Forks167
Last commit14 days ago
RAPIDS cuGraph
RAPIDS cuGraphCuda

A collection of GPU-accelerated graph analytics libraries for creating, manipulating, and executing scalable graph algorithms.

#cuda#high-performance-computing#graph
Stars2.2k
Forks350
Last commit1 day ago
Page 1 of 2Next

Related Tags

#Machine Learning17#Deep Learning15#Gpu Acceleration14#Gpu13#Computer Vision11#High Performance Computing11#C Plus Plus11#Neural Networks10#Parallel Computing10#Gpu Computing8#Nvidia7#Python6
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub