Cpu Optimization

12 projects

Showing 12 of 12 projects

BigDLPython

An LLM acceleration library for Intel XPU (GPU, NPU, CPU) to speed up local inference and finetuning of popular models.

#finetuning#llm-acceleration#cpu-optimization

Stars8.9k

Forks1.4k

Last commit5 months ago

oneDNNC++

An open-source cross-platform performance library of basic building blocks for deep learning applications, optimized for CPUs and GPUs.

#oneapi#neural-network#jit-compilation

A compiler for a C-based SPMD language that generates high-performance SIMD code for CPUs and GPUs.

#programming-language#parallel-computing#high-performance-computing

Stars2.9k

Forks348

Last commit5 days ago

VcC++

A portable C++ library providing SIMD vector types for explicit data-parallel programming with zero-overhead abstractions.

#parallel-computing#high-performance-computing#simd-instructions

Stars1.5k

Forks151

Last commit14 days ago

Intel(R) Extension for Scikit-learnPython

A free software AI accelerator that speeds up scikit-learn applications by 10-100x on CPUs and GPUs with no code changes.

#oneapi#ai-machine-learning#ai-accelerator

A C++ template library providing high-performance SIMD-accelerated sorting algorithms for integers, floats, and custom objects.

#template-library#parallel-computing#high-performance-computing

A Pascal-based deep learning neural network API optimized for AVX/AVX2/AVX512 and OpenCL, supporting AMD, Intel, and NVIDIA hardware.

#free-pascal#avx-optimization#opencl

A fast, header-only C/C++ library for counting 1 bits in arrays using optimized CPU instructions like POPCNT, AVX2, AVX512, NEON, and SVE.

#c-library#simd#bitcount

Stars368

Forks44

Last commit13 days ago

whisper-openvinoJupyter Notebook

A fork of OpenAI's Whisper speech recognition models optimized with OpenVINO backend for faster CPU inference.

#intel#cpu-optimization#asr

Stars184

Forks16

Last commit2 years ago

gl_vk_threaded_cadsceneC++

A deprecated sample comparing OpenGL and Vulkan rendering techniques for CAD scenes using multi-threaded command buffer generation.

#vulkan#graphics#opengl

Stars168

Forks27

Last commit1 year ago

FastMM4-AVXPascal

A high-performance fork of FastMM4 with AVX/AVX2/AVX512 support, efficient synchronization, and FreePascal compatibility.

#memory-manager#memory-management#performance-optimization

Stars154

Forks24

Last commit3 months ago

Stardust from IntelC

A Vulkan sample application that renders 200,000 animated particles using multithreaded draw calls to demonstrate low CPU overhead.

#vulkan#performance-demo#cpu-optimization

Stars119

Forks13

Last commit3 years ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub