Model Optimization

15 projects

Showing 15 of 15 projects

A web UI and optimization library for running and fine-tuning open-source AI models locally with 2x faster training and 70% less VRAM.

#multi-modal-ai#llama3#gpu-acceleration

Stars68.4k

Forks6.2k

Last commit5 hours ago

bitnet.cppC++

Official inference framework for 1-bit LLMs, enabling fast and lossless CPU/GPU inference with significant speed and energy efficiency gains.

#gpu-inference#transformer-models#cpu-inference

A framework for programming language models with Python instead of prompting, enabling modular AI systems with automatic prompt optimization.

#declarative-programming#ai-framework#python-library

A blazing-fast, lightweight deep learning inference engine from Alibaba, optimized for on-device LLMs and Edge AI.

#vulkan#winograd-algorithm#neural-network

An open-source LLMOps platform unifying gateway, observability, evaluation, optimization, and experimentation for industrial-grade LLM applications.

#ai-gateway#ai#deep-learning

Stars11.7k

Forks949

Last commit1 month ago

Awesome Decision Tree PapersPython

A curated collection of research papers on decision, classification, and regression trees with implementations from top ML conferences.

#random-forest#decision-tree-classifier#ensemble-methods

A quantization extension for Keras that provides drop-in replacement layers for creating quantized deep learning models in TensorFlow.

#fpga#deep-learning#asic-design

Stars584

Forks109

Last commit4 months ago

Auto_ViMLPython

Automatically builds high-performance interpretable machine learning models with minimal features using a single line of code.

#data-cleaning#imbalanced-data#feature-selection

Stars548

Forks104

Last commit1 year ago

ArchaiPython

A Python library for fast, reproducible, and modular Neural Architecture Search (NAS) to generate efficient deep networks.

#deep-learning#automl#petridish

Stars485

Forks93

Last commit7 months ago

Adventures in TensorFlow LiteJupyter Notebook

A collection of Jupyter notebooks demonstrating TensorFlow Lite model quantization, conversion, and optimization techniques for deep neural networks.

#model-quantization#model-conversion#on-device-ml

A JAX transform that implements LoRA (Low-Rank Adaptation) for efficient fine-tuning of large models with minimal memory overhead.

#parameter-efficient-training#jax#deep-learning

Stars143

Forks6

Last commit2 years ago

fewPython

A feature engineering wrapper for scikit-learn that uses genetic programming to find optimal feature transformations for machine learning models.

#data-science#python#feature-engineering

Stars53

Forks17

Last commit6 years ago

Keras GPT CopilotPython

A Python package that integrates an LLM copilot into Keras model development to provide performance improvement suggestions.

#llm-copilot#skynet#deep-learning

Stars28

Forks2

Last commit2 years ago

Dynamic Capacity NetworksPython

A TensorFlow implementation of Dynamic Capacity Networks, which reduces computations by applying high-capacity networks to selected input patches.

#computational-efficiency#deep-learning#dynamic-capacity-networks

A dynamic inference wrapper for Transformer language models that enables per-token layer skipping to reduce computational FLOPs.

#autoregressive-models#layer-skipping#kv-cache

Stars2

Forks0

Last commit7 months ago

Related Tags

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub