Llm Inference

10 projects

Showing 10 of 10 projects

llama.cppC++

A C/C++ library for efficient, cross-platform LLM inference with extensive hardware support and quantization.

A high-throughput, memory-efficient inference and serving engine for large language models (LLMs).

#distributed-inference#transformer#cuda

Run large language models (LLMs) privately on everyday desktops and laptops without requiring API calls or GPUs.

#desktop-application#ai-chat#model-inference

Stars77.4k

Forks8.3k

Last commit11 months ago

bitnet.cppPython

Official inference framework for 1-bit LLMs, enabling fast and lossless CPU/GPU inference with significant speed and energy efficiency gains.

#gpu-inference#transformer-models#cpu-inference

Stars38.5k

Forks3.5k

Last commit1 month ago

BentoMLPython

A Python library for building production-ready model inference APIs, job queues, and multi-model serving systems for AI applications.

#llm-serving#python-library#deep-learning

Stars8.6k

Forks950

Last commit8 days ago

mistral.rsRust

A fast, flexible, and hardware-aware LLM inference engine with zero-config support for any Hugging Face model.

#agentic-ai#quantization#llm

Stars7.0k

Forks581

Last commit9 days ago

planoRust

An AI-native proxy and data plane for agentic applications, providing built-in orchestration, safety, observability, and smart LLM routing.

#proxy#observability#gateway

A fast and comprehensive machine learning framework for Java, Scala, and Kotlin with state-of-the-art algorithms and data visualization.

#deep-learning#interpolation#classification

A lightweight, single-binary Rust inference server providing 100% OpenAI-API compatible endpoints for local GGUF models.

#safetensors#privacy-first#lora

Stars4.0k

Forks348

Last commit29 days ago

ruvectorRust

A self-learning vector database with graph intelligence, local AI, and PostgreSQL integration, built for real-time adaptation.

#ai#agentic-ai#self-learning

Stars3.8k

Forks472

Last commit1 day ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub