Model Serving

19 projects

Showing 19 of 19 projects

OllamaGo

A platform to run, manage, and serve open-source large language models (LLMs) locally or on your own infrastructure.

#ai-infrastructure#llama3#rest-api

A high-throughput, memory-efficient inference and serving engine for large language models (LLMs).

#distributed-inference#transformer#cuda

An open platform for training, serving, and evaluating large language model based chatbots.

#model-training#distributed-systems#chatbot

Stars39.5k

Forks4.8k

Last commit10 months ago

candle-wasm-examplesRust

A minimalist, high-performance machine learning framework for Rust with a focus on serverless inference and GPU support.

#quantization#neural-networks#model-serving

Stars20.1k

Forks1.5k

Last commit2 days ago

Kubeflow

A composable, modular, and scalable machine learning toolkit for building AI platforms on Kubernetes.

#distributed-training#hyperparameter-tuning#notebook

Stars15.6k

Forks2.6k

Last commit3 months ago

LudwigPython

A low-code declarative framework for building custom LLMs, neural networks, and other AI models with YAML configurations.

#distributed-training#declarative-config#natural-language

Stars11.7k

Forks1.2k

Last commit5 days ago

triton-inference-serverPython

An open-source inference serving platform for deploying AI models from multiple frameworks across cloud, data center, and edge devices.

#inference-serving#datacenter#deep-learning

Stars10.6k

Forks1.8k

Last commit

Machine Learning InterviewsHTML

A practical booklet covering the four main steps of designing machine learning systems with 27 interview questions.

#data-science#machine-learning-production#production-ml

A Python library for building production-ready model inference APIs, job queues, and multi-model serving systems for AI applications.

#llm-serving#python-library#deep-learning

Stars8.6k

Forks950

Last commit8 days ago

CortexGo

A platform for deploying, managing, and scaling machine learning models in production on AWS infrastructure.

#batch-processing#production-ml#kubernetes

Stars8.0k

Forks597

Last commit1 year ago

mistral.rsRust

A fast, flexible, and hardware-aware LLM inference engine with zero-config support for any Hugging Face model.

#agentic-ai#quantization#llm

Stars7.0k

Forks581

Last commit9 days ago

ClearMLPython

An open-source MLOps/LLMOps suite for experiment management, data management, pipelines, orchestration, scheduling, and model serving.

#experiment-manager#ai#version-control

Stars6.6k

Forks771

Last commit3 days ago

LLM Engineer Handbook

A curated collection of resources for building, training, serving, and optimizing production-grade Large Language Model applications.

#llmops#fine-tuning#prompt-engineering

Stars4.9k

Forks689

Last commit8 months ago

Seldon CoreGo

An MLOps framework to package, deploy, monitor, and manage thousands of production machine learning models on Kubernetes.

#deployment#aiops#ai-pipelines

Stars4.7k

Forks862

Last commit1 month ago

DeepDetectC++

An open-source deep learning API and server written in C++ that supports multiple backends like PyTorch, TensorRT, and TensorFlow for training and inference.

#deep-learning#rest-api#neural-nets

Stars2.5k

Forks551

Last commit7 days ago

EasyLMPython

A JAX/Flax-based framework for easy and scalable pre-training, fine-tuning, evaluation, and serving of large language models.

#transformer#distributed-training#jax

Stars2.5k

Forks261

Last commit1 year ago

tfgoGo

A Go library that simplifies TensorFlow's Go bindings with method chaining, automatic scoping, and type conversion.

#deep-learning#graph-computation#model-serving

Stars2.5k

Forks158

Last commit2 years ago

envdGo

A command-line tool for creating reproducible, container-based development environments for AI/ML workflows.

#cuda#hacktoberfest#developer-tools

Stars2.2k

Forks167

Last commit14 days ago

nndeployC++

A visual workflow-based AI deployment framework for multi-platform and multi-backend inference, supporting large models and edge devices.

#ai#deployment#parallel-computing

Stars1.8k

Forks214

Last commit2 days ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub