Showing 3 of 3 projects
A high-throughput, memory-efficient inference and serving engine for large language models (LLMs).
A high-performance serving framework for large language models and multimodal models, delivering low-latency and high-throughput inference.
A pure Crystal machine learning library for building and training neural networks with CPU/GPU support and PyTorch compatibility.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.