Yu's MemoCapsule

Analysis

Network Design in Stable Diffusion

StabilityAI has recently open sourced a series of foundational models for image generation, called Stable Diffusion. Although we know these models are based on latent diffusion, there are few reports mention their detailed designs. To facilitate better understanding and potential future improvement, this blog provide some information about the designs of Unet and VAE, which are key components of the magic generation. Unet Fig. 1: Overall of the Unet in Stable Diffusion 1.