Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Stacks
  3. CUDA
C

CUDA

Other
177 projects1055.4k total stars216.9k total forks23 languages

Open-source projects built with CUDA

There are currently 177 open-source projects built with CUDA, with a combined total of 1055.4k GitHub stars. The most common language among these projects is Python.

Showing 176 open-source projects · page 3 of 5

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
TensorComprehensions
TensorComprehensionsfacebookresearch/TensorComprehensions

A domain-specific language and C++ library for automatically synthesizing high-performance machine learning kernels.

1.8k213C++
3 years ago
Votenet
Votenetfacebookresearch/votenet

An end-to-end 3D object detection network that uses deep point set networks and Hough voting to directly detect objects in point clouds.

1.8k387Python
4 years ago
VkFFT
VkFFTDTolm/VkFFT

A high-performance GPU-accelerated Fast Fourier Transform library supporting Vulkan, CUDA, HIP, OpenCL, Level Zero, and Metal backends.

1.7k134C++
2 months ago
MarS
MarSmicrosoft/MarS

A financial market simulation engine powered by a generative foundation model for realistic, interactive, and controllable order generation.

1.7k183Python
1 month ago
GitHub repository
GitHub repositorycharlesq34/frustum-pointnets

A deep learning pipeline for 3D object detection from RGB-D data by combining 2D detectors with PointNet-based 3D processing.

1.7k531Python
6 years ago
fast_gicp
fast_gicpSMRT-AIST/fast_gicp

A collection of high-performance GICP-based point cloud registration algorithms with multi-threaded and GPU-accelerated implementations.

1.6k370C++
1 year ago
ThunderSVM
ThunderSVMXtra-Computing/thundersvm

A fast Support Vector Machine (SVM) library that leverages GPUs and multi-core CPUs for high-performance machine learning.

1.6k222C++
2 years ago
GitHub repository
GitHub repositoryQingyongHu/RandLA-Net

An efficient neural network for semantic segmentation of large-scale 3D point clouds using random sampling.

1.5k334Python
2 years ago
Speech Note
Speech Notemkiol/dsnote

A Linux desktop app for offline note-taking, reading, and translation using speech-to-text, text-to-speech, and machine translation.

1.5k63C++
2 days ago
MatX
MatXNVIDIA/MatX

A modern C++20 GPU numerical computing library with Python-like syntax for near-native performance on NVIDIA GPUs.

1.4k117C++
1 day ago
Personae
PersonaeCeruleanacg/Personae

A repository implementing Deep Reinforcement Learning and Supervised Learning methods with a simulated financial market environment for quantitative trading.

1.4k340Python
7 years ago
Deeppy
Deeppyandersbll/deeppy

A Pythonic deep learning framework built on NumPy with optional CUDA acceleration.

1.4k299Python
5 years ago
Image Analogy
Image Analogymsracver/Deep-Image-Analogy

A deep learning technique for finding semantically-meaningful dense correspondences between images to enable visual attribute transfer.

1.4k231C++
4 years ago
Mocha.jl
Mocha.jlpluskid/Mocha.jl

A deep learning framework for Julia inspired by Caffe, featuring modular architecture and multiple backends.

1.3k245Julia
7 years ago
CTPN
CTPNtianzhi0549/CTPN

Scene text detection using Connectionist Text Proposal Network (CTPN) for detecting text lines in natural images.

1.3k530Jupyter Notebook
4 years ago
cortex
cortexoriginrose/cortex

A Clojure library for neural networks, regression, and feature learning with GPU acceleration support.

1.3k109Clojure
7 years ago
hyperpose
hyperposetensorlayer/hyperpose

A library for building high-performance custom human pose estimation applications with real-time inference and flexible model development.

1.3k272Python
3 years ago
stdgpu
stdgpustotko/stdgpu

A C++17 library providing efficient STL-like data structures (vector, unordered_map, etc.) for GPU programming with CUDA, OpenMP, and HIP backends.

1.3k99C++
1 month ago
hebel
hebelhannes-brt/hebel

A GPU-accelerated deep learning library for Python using CUDA via PyCUDA, implementing neural networks with various training methods.

1.2k117Python
5 years ago
hyprwhspr
hyprwhsprgoodroot/hyprwhspr

A fast, accurate, and private native speech-to-text tool for Linux, offering system-wide dictation with local or cloud backends.

1.1k76Python
4 days ago
Cupoch
Cupochneka-nat/cupoch

A CUDA-accelerated library for rapid 3D data processing in robotics, enabling GPU-powered SLAM, collision avoidance, and path planning.

1.1k118C++
2 months ago
GitHub repository
GitHub repositorydrprojects/superpoint_transformer

A PyTorch framework for efficient 3D semantic and panoptic segmentation using superpoint-based transformer architectures.

1.0k132Python
1 month ago
DetectAndTrack
DetectAndTrackfacebookresearch/DetectAndTrack

A CVPR 2018 algorithm for efficient multi-person pose estimation and tracking in videos, ranking first in the ICCV 2017 PoseTrack challenge.

1.0k185Python
7 years ago
RLtools
RLtoolsrl-tools/rl-tools

A high-performance, portable deep reinforcement learning library for continuous control, optimized for speed across CPUs, GPUs, and microcontrollers.

98755C++
2 months ago
Penglab
Penglabmxrch/penglab

A Google Colab notebook setup for high-performance hash cracking and penetration testing tools.

969143Jupyter Notebook
2 years ago
BitCracker
BitCrackere-ago/bitcracker

An open-source GPU-accelerated password cracking tool for BitLocker-encrypted storage devices using dictionary attacks.

962207C
2 years ago
deepdive
deepdivedeepdrive/deepdrive

An open-source simulator for experimenting with and advancing self-driving AI, accessible to anyone with a PC.

924150Python
2 years ago
Deepnet
Deepnetnitishsrivastava/deepnet

GPU-accelerated Python implementation of six fundamental deep learning algorithms using CUDA libraries.

895433Python
12 years ago
cuda-api-wrappers
cuda-api-wrapperseyalroz/cuda-api-wrappers

Thin, unified C++ wrappers for NVIDIA's CUDA APIs (Runtime, Driver, NVRTC, NVTX) that improve safety and ease of use.

89087C++
4 days ago
neural-style-pt
neural-style-ptProGamerGov/neural-style-pt

A PyTorch implementation of neural style transfer, combining the content of one image with the artistic style of another.

860170Python
3 years ago
torch-rb
torch-rbankane/torch-rb

A Ruby deep learning library powered by LibTorch, providing a PyTorch-like API for Ruby developers.

83237Ruby
26 days ago
Torch.rb
Torch.rbankane/torch.rb

A deep learning library for Ruby that provides a native interface to LibTorch, enabling GPU-accelerated neural network development.

83237Ruby
26 days ago
Objax
Objaxgoogle/objax

An object-oriented machine learning framework built on JAX, designed for simplicity and readability in research.

77372Python
2 years ago
DNABERT
DNABERTjerryji1993/DNABERT

A pre-trained BERT model designed for DNA sequence analysis, enabling genome understanding tasks like classification and motif discovery.

756179Python
4 months ago
OpenChem
OpenChemMariewelt/OpenChem

A deep learning toolkit for computational chemistry and drug design research with PyTorch backend.

746120Python
2 years ago
GitHub repository
GitHub repositoryPRBonn/OverlapNet

A Siamese neural network for LiDAR-based loop closing and localization by predicting scan overlap and relative yaw angle.

730117Python
3 years ago
1
2
3
4
5