There are currently 142 open-source projects built with PyTorch, with a combined total of 2138.3k GitHub stars. The most common language among these projects is Python.
Showing 142 open-source projects · page 1 of 4
A model-definition framework for state-of-the-art machine learning models across text, vision, audio, and multimodal tasks.
A model-definition framework for state-of-the-art machine learning models across text, vision, audio, and multimodal tasks.
A comprehensive collection of Chinese NLP resources, datasets, tools, and pre-trained models for developers and researchers.
A latent text-to-image diffusion model that generates detailed images from text prompts, running on GPUs with at least 10GB VRAM.
A collection of 60+ annotated PyTorch implementations of deep learning papers with side-by-side explanatory notes.
A web UI and optimization library for running and fine-tuning open-source AI models locally with 2x faster training and 70% less VRAM.
A state-of-the-art PyTorch-based computer vision model for object detection, segmentation, and classification.
A cutting-edge framework for training and deploying state-of-the-art YOLO models for object detection, segmentation, classification, and pose estimation.
A deep learning toolkit for Text-to-Speech generation with pretrained models in over 1100 languages and tools for training.
A unified deep learning system for efficient large-scale model training and inference with advanced parallelism strategies.
An open platform for training, serving, and evaluating large language model based chatbots.
A transformer-based text-to-audio model that generates realistic multilingual speech, music, and sound effects.
A comprehensive collection of PyTorch image models, layers, utilities, and training scripts for computer vision research and applications.
Interactive point-based manipulation tool for editing GAN-generated images by dragging points to target positions.
A PyTorch-based platform for state-of-the-art object detection, segmentation, and visual recognition tasks.
Industrial-strength Natural Language Processing library for Python, featuring pretrained pipelines for 70+ languages and production-ready training.
A modular PyTorch library for state-of-the-art diffusion models to generate images, audio, and 3D molecular structures.
A collection of concise PyTorch tutorials for deep learning researchers, with most models implemented in under 30 lines of code.
A deep learning framework to pretrain and finetune any AI model on any hardware with zero code changes.
A deep learning framework to pretrain and finetune any AI model at any scale with zero code changes.
A PyTorch wrapper that automates engineering boilerplate for scalable AI model training and deployment.
A ready-to-use OCR Python library supporting 80+ languages and popular writing scripts like Latin, Chinese, Arabic, and Cyrillic.
A deep learning library built on PyTorch that provides high-level components for rapid results and low-level components for research flexibility.
A deep learning library built on PyTorch that provides high-level components for rapid results and low-level components for research flexibility.
A high-performance serving framework for large language models and multimodal models, delivering low-latency and high-throughput inference.
An array framework for machine learning on Apple silicon with unified memory and dynamic graph construction.
Minimal inference code for running FLUX.1 open-weight models for image generation and editing.
A unified Python library for explaining any machine learning model's predictions using Shapley values from game theory.
The fastai book, published as Jupyter Notebooks, provides an introduction to deep learning, fastai, and PyTorch.
A PyTorch library for building and training Graph Neural Networks (GNNs) on structured and irregular data.
A comprehensive library for building and training Graph Neural Networks (GNNs) with PyTorch.
A lightweight Python library for face recognition and facial attribute analysis (age, gender, emotion, race) with a unified API.
A repository of examples, utilities, and best practices for building and deploying production-ready recommendation systems.
Fast automatic speech recognition with accurate word-level timestamps and speaker diarization, built on OpenAI's Whisper.
An open-source framework for financial large language models, enabling cost-effective fine-tuning for tasks like sentiment analysis and forecasting.
An offline desktop application for transcribing and translating audio/video files, live recordings, and YouTube links using OpenAI's Whisper.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.