What is the architecture of Stable Diffusion 3 Medium?
Question
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Lorem ipsum dolor sit amet, consectetur adipiscing elit.Morbi adipiscing gravdio, sit amet suscipit risus ultrices eu.Fusce viverra neque at purus laoreet consequa.Vivamus vulputate posuere nisl quis consequat.
Answers ( 1 )
**Stable Diffusion 3 Medium (SD3 Medium)** uses a **Multimodal Diffusion Transformer (MMDiT)** architecture. It incorporates three text encoders (OpenCLIP-ViT/G, CLIP-ViT/L, and T5-xxl) for prompt processing and a 16-channel VAE for enhanced image detail, particularly in hands and faces.