Stable Diffusion — Text-to-Image Diffusion Model | Open Awesome

Home
Game Engine Development
Stable Diffusion

Stable Diffusion

NOASSERTIONJupyter Notebook

A latent text-to-image diffusion model that generates detailed images from text prompts, running on GPUs with at least 10GB VRAM.

Visit Website GitHub

72.9k stars10.6k forks0 contributors

What is Stable Diffusion?

Stable Diffusion is an open-source latent text-to-image diffusion model that generates detailed images from natural language prompts. It solves the problem of high-quality image synthesis by operating efficiently in a compressed latent space, making it runnable on GPUs with at least 10GB VRAM. The model is conditioned on text embeddings from a frozen CLIP ViT-L/14 encoder and is trained on subsets of the LAION-5B dataset.

Target Audience

AI researchers, machine learning engineers, and developers working on generative art, creative tools, or applications requiring text-to-image synthesis. It's also suitable for hobbyists with capable GPU hardware.

Value Proposition

Developers choose Stable Diffusion for its open-source nature, relatively lightweight architecture compared to alternatives like Imagen, and strong community support through integrations like diffusers. It offers a balance of high-quality output and computational efficiency.

Overview

A latent text-to-image diffusion model

Use Cases

Best For

Generating artistic or photorealistic images from text descriptions

Related Projects

Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a project Star on GitHub

Creating concept art and visual prototypes from written ideas

Performing text-guided image-to-image translation and upscaling

Researching latent diffusion models and generative AI techniques

Building custom creative tools or AI-powered design applications

Educational projects on state-of-the-art image synthesis models

Open Source Alternative To

Stable Diffusion is an open-source alternative to the following products:

DALL-E

DALL-E is an AI system developed by OpenAI that creates realistic images and art from natural language descriptions.

Imagen

Imagen is a text-to-image diffusion model developed by Google Research that generates high-quality images from natural language descriptions.

Quick Stats

Stars72,925

Forks10,612

Contributors0

Open Issues538

Last commit1 year ago

CreatedSince 2022

Built With

Links & Resources

Website

Included in

Auto-fetched 1 day ago

Shap-e

Generate 3D objects conditioned on text or images

Stars12,239

Forks1,069

Last commit1 year ago

Sonic Pi

Code. Music. Live.

Stars11,772

Forks976

Last commit4 days ago

Game Engine Development1.3k