Showing 36 of 52 projects
A refreshingly simple, data-driven game engine built in Rust, offering a complete 2D and 3D feature set.
A prototype web browser engine written in Rust, designed for parallel processing and embedding in applications.
A scalable, portable, and distributed gradient boosting library for efficient machine learning across multiple languages and platforms.
A C++ parallel computing and asynchronous networking framework for building high-performance, high-concurrency backend services.
A lightweight data-parallelism library for Rust that makes it easy to convert sequential computations into parallel ones.
A fast, industrial-strength, lock-free concurrent queue for C++11 supporting multi-producer and multi-consumer patterns.
A fast, expressive, and header-only C++ library for building task-parallel programs with static, dynamic, and conditional task graphs.
A drop-in replacement for pandas that scales data analysis workflows to use all CPU cores and handle out-of-memory datasets.
A scientific computing framework with wide support for machine learning algorithms, built around multi-dimensional tensor operations.
A simple C++11 thread pool implementation for parallel task execution in C++ applications.
A data-oriented and data-driven game engine written in Rust, designed for extreme multithreading and clean architecture.
A Python framework for rapid prototyping and testing of evolutionary algorithms, including genetic algorithms, genetic programming, and evolution strategies.
A Python framework for rapid prototyping and testing of evolutionary algorithms, including genetic algorithms, genetic programming, and evolution strategies.
A cryptographic hash function that is significantly faster than SHA-256, highly parallelizable, and serves as a PRF, MAC, KDF, and XOF.
A comprehensive suite of modern concurrency abstractions and thread-safe utilities for Ruby, inspired by Erlang, Clojure, and Java.
A C++ parallel algorithms library that enables high-performance computing on GPUs and multicore CPUs with a productivity-focused interface.
A general-purpose tensor library for parallel computing across CPUs, GPUs, and hardware accelerators.
A JAX implementation of OpenAI's Whisper model offering up to 70x faster transcription on TPUs.
A library of optimized communication primitives for multi-GPU and multi-node collective operations.
A Python package for working with labeled multi-dimensional arrays, inspired by pandas and tailored for scientific data.
A fast parallel implementation of the Connectionist Temporal Classification (CTC) loss function for CPU and GPU.
An open-source cluster resource management and job scheduling system for Linux-based high-performance computing.
A high-performance R package for fast data manipulation of large datasets, extending data.frame with concise syntax and memory efficiency.
A library that simplifies using web workers and worker threads across Node.js, browsers, and Electron with a uniform API.
A family of header-only C++ hashmap and btree containers offering drop-in replacements for std::unordered_map/std::map with superior speed and memory efficiency.
An open-source library for high-performance subdivision surface evaluation on CPU and GPU, matching Pixar's Renderman precision.
A fast, lightweight, modern, and easy-to-use C++17/20/23 thread pool library for high-performance parallel computing.
A compiler for a C-based SPMD language that generates high-performance SIMD code for CPUs and GPUs.
High-performance C++ library for multiphysics and multibody dynamics simulations, including rigid/flexible bodies, granular dynamics, and fluid-solid interaction.
A modern C++ concurrency library providing tasks, executors, timers, and C++20 coroutines for writing highly concurrent applications.
A fast, non-deadlocking parallel object cache for iOS, tvOS, watchOS, and macOS, designed to persist expensive-to-reproduce temporary objects.
A Python package that automatically accelerates pandas and Modin DataFrame apply operations by choosing the fastest available method.
A parallel Entity-Component System (ECS) library for Rust, designed for high flexibility and performance.
Portable implementation of Grand Central Dispatch (GCD) for concurrent code execution on multicore hardware across Swift platforms.
An open-source implementation of the Message Passing Interface (MPI) specification for high-performance computing.
A C++ programming model for writing performance-portable applications targeting all major HPC platforms.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.