Home / Companies / Replicate / Blog / Post Details
Content Deep Dive

How to get the best results from Stable Diffusion 3

Blog post from Replicate

Post Details
Company
Date Published
Author
fofr
Word Count
2,100
Language
English
Hacker News Points
-
Summary

Stable Diffusion 3 (SD3), recently released by Stability AI, is a powerful text-to-image model that excels in photorealism, typography, and following detailed prompts, with its support for prompts up to 10,000 characters. SD3 provides multiple versions with different text encoder configurations to accommodate varying VRAM capacities, allowing users to select options based on their hardware capabilities. The model's key innovation lies in its ability to handle long, descriptive prompts without the limitations of previous token restrictions, though users are advised to avoid negative prompts as they do not function as expected. SD3 employs three different text encoders, and users can experiment with different prompts for each encoder to optimize image generation. The recommended settings for generating high-quality images include using 28 steps, a guidance scale (CFG) between 3.5 and 4.5, and the dpmpp_2m sampler with the sgm_uniform scheduler. The introduction of a new parameter, "shift," allows for better noise management in high-resolution images, enhancing output quality. Stability AI has also open-sourced Diffusers and ComfyUI implementations for SD3, enabling users to experiment with and customize their configurations further.