Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Stacks
  3. CUDA
C

CUDA

Other
177 projects1055.4k total stars216.9k total forks23 languages

Open-source projects built with CUDA

There are currently 177 open-source projects built with CUDA, with a combined total of 1055.4k GitHub stars. The most common language among these projects is Python.

Showing 176 open-source projects · page 5 of 5

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub
scirs
scirscool-japan/scirs

A comprehensive scientific computing and AI/ML library in pure Rust, offering SciPy-compatible APIs with 10-100x performance gains.

23733Rust
2 days ago
Calibnet
Calibnetepiception/CalibNet

A self-supervised deep learning model for extrinsic calibration between LiDAR and camera sensors using 3D spatial transformer networks.

22956Python
2 years ago
CAPTCHA-breaking
CAPTCHA-breakinglllcho/CAPTCHA-breaking

A Python-based CAPTCHA breaking solution using Keras and OpenCV, developed for a data science competition.

22281Python
10 years ago
DBNet
DBNetdriving-behavior/DBNet

A large-scale driving behavior dataset with LiDAR point clouds, dashboard videos, and sensor data for autonomous driving research.

22149Python
7 years ago
captcha.irctc
captcha.irctcarunpatala/captcha.irctc

A deep learning model that reads IRCTC captchas with 98% accuracy, demonstrating their vulnerability to automated booking.

21540Lua
5 years ago
cunn
cunntorch/cunn

CUDA backend implementation for Torch's neural network package, enabling GPU acceleration for deep learning models.

213173Cuda
6 years ago
ClojureCUDA
ClojureCUDAuncomplicate/clojurecuda

A Clojure library for GPU-accelerated computing using NVIDIA CUDA, enabling high-performance parallel processing.

20612C
20 days ago
Usiigaci
Usiigacioist/usiigaci

A semi-automated pipeline for instance-aware cell segmentation, tracking, and migration analysis in phase contrast microscopy using Mask R-CNN.

20570Jupyter Notebook
5 years ago
KFusion: Implementation of KinectFusion
KFusion: Implementation of KinectFusionGerhardR/kfusion

A CUDA-based implementation of KinectFusion for real-time dense surface reconstruction and tracking using a Kinect camera.

19882C++
11 years ago
shainet
shainetNeuraLegion/shainet

A pure Crystal machine learning library for building and training neural networks with CPU/GPU support and PyTorch compatibility.

19519Crystal
5 months ago
Dockerface
Dockerfacenatanielruiz/dockerface

A Docker container for face detection using Faster R-CNN deep learning, processing videos and images with bounding box outputs.

19132Dockerfile
6 years ago
DAGAN
DAGANnebulaV/DAGAN

A deep learning model using generative adversarial networks for fast compressed sensing MRI reconstruction.

18053Python
7 years ago
MotionNet
MotionNetpxiangwu/MotionNet

A deep learning model for joint perception and motion prediction in autonomous driving using bird's eye view maps.

17325
6 years ago
stochastic-rs
stochastic-rsrust-dd/stochastic-rs

A high-performance Rust library for simulating stochastic processes, with applications in quantitative finance, statistical modeling, and synthetic data generation.

1677Rust
2 days ago
N2D2
N2D2CEA-LIST/N2D2

An open-source CAD framework for designing, simulating, and deploying deep neural networks on embedded platforms.

16039C
1 year ago
docker-hashcat
docker-hashcatdizcza/docker-hashcat

Dockerized hashcat with multiple backends (CUDA, OpenCL, POCL) for GPU-accelerated password recovery and hash cracking.

15945Dockerfile
9 months ago
DH3D
DH3DJuanDuGit/DH3D

A deep learning approach that unifies global place recognition and local 6DoF pose refinement for robust relocalization in large-scale 3D point clouds.

15817Python
5 years ago
MxNet.Sharp
MxNet.Sharptech-quantum/MxNet.Sharp

.NET Standard bindings for Apache MXNet, providing C# developers with NumPy-compatible APIs for machine learning model development, training, and deployment.

1517C#
3 years ago
neural-dream
neural-dreamProGamerGov/neural-dream

A PyTorch implementation of the DeepDream algorithm for generating psychedelic, dream-like images from neural network activations.

14720Python
4 years ago
Merlin
Merlinhshindo/Merlin.jl

A fast, flexible, and compact deep learning framework for Julia that runs on CPU and CUDA GPU.

14610Julia
6 years ago
SPOC
SPOCmathiasbourgoin/SPOC

A PPX-based DSL for writing GPU kernels in OCaml syntax that compiles to multiple backends (CUDA, OpenCL, Vulkan, Metal).

14311HTML
2 days ago
DCompute
DComputelibmir/dcompute

A set of libraries enabling native execution of D code on GPUs and other accelerators via OpenCL and CUDA runtimes.

14333D
23 days ago
mmWave-localization-learning
mmWave-localization-learninggante/mmWave-localization-learning

A machine learning algorithm for accurate, energy-efficient outdoor positioning using 5G mmWave beamformed fingerprints.

13243Python
1 year ago
GeneCompass
GeneCompassxCompass-AI/GeneCompass

A knowledge-informed cross-species foundation model pre-trained on over 120 million human and mouse single-cell transcriptomes to decipher universal gene regulatory mechanisms.

11723Jupyter Notebook
3 months ago
ANNetGPGPU
ANNetGPGPUANNetGPGPU/ANNetGPGPU

A GPU-accelerated (CUDA) C++ template library for building and training artificial neural networks, including self-organizing maps and back-propagation networks.

11324C++
4 years ago
CellPLM
CellPLMOmicsML/CellPLM

A pre-trained language model for single-cell RNA sequencing data that encodes cell-cell relations and accelerates inference for downstream tasks.

10315Jupyter Notebook
2 years ago
CUB
CUBNVlabs/cub

A library of reusable CUDA C++ software components for parallel algorithms like sorting, prefix scan, reduction, and histogram.

8747Cuda
2 years ago
DockerDL
DockerDLmatifali/dockerdl

A pre-configured Docker image with deep learning frameworks, data science tools, and GPU support for rapid environment setup.

8511Dockerfile
3 months ago
SMAP
SMAPjries/SMAP

A modular MATLAB-based platform for analyzing super-resolution microscopy (SMLM) data with GPU-accelerated fitting.

8425MATLAB
3 days ago
RootPainter
RootPainterAbe404/root_painter

A GUI-based tool for training deep neural networks to segment biological images using corrective annotation.

7826Python
1 month ago
RustTensor
RustTensorramsyana/RustTensor

A learning-focused, high-performance tensor computation library built from scratch in Rust with automatic differentiation and CPU/CUDA backends.

771Rust
1 year ago
im2im
im2imzsdonghao/Unsup-Im2Im

An implementation of unsupervised image-to-image translation using Generative Adversarial Networks (GANs).

7216Python
5 years ago
1
2
3
4
5