Showing 11 of 11 projects
A web UI and optimization library for running and fine-tuning open-source AI models locally with 2x faster training and 70% less VRAM.
Official inference framework for 1-bit LLMs, enabling fast and lossless CPU/GPU inference with significant speed and energy efficiency gains.
A framework for programming language models with Python instead of prompting, enabling modular AI systems with automatic prompt optimization.
A blazing-fast, lightweight deep learning inference engine from Alibaba, optimized for on-device LLMs and Edge AI.
An open-source LLMOps platform unifying gateway, observability, evaluation, optimization, and experimentation for industrial-grade LLM applications.
A curated collection of research papers on decision, classification, and regression trees with implementations from top ML conferences.
A quantization extension for Keras that provides drop-in replacement layers for creating quantized deep learning models in TensorFlow.
Automatically builds high-performance interpretable machine learning models with minimal features using a single line of code.
A Python library for fast, reproducible, and modular Neural Architecture Search (NAS) to generate efficient deep networks.
A collection of Jupyter notebooks demonstrating TensorFlow Lite model quantization, conversion, and optimization techniques for deep neural networks.
A JAX transform that implements LoRA (Low-Rank Adaptation) for efficient fine-tuning of large models with minimal memory overhead.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.