feat: add icon and description for Stable Diffusion benchmark#917
feat: add icon and description for Stable Diffusion benchmark#917anhappdev merged 5 commits intosubmission-v4.1from
Conversation
|
MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅ |
|
|
@AhmedTElthakeb please report number of parameters and FLOPs of the 3 models we use. |
|
bc50b66 to
be59d3c
Compare
|
|
@Mostelk Please provide a description for the Stable Diffusion benchmark. |
Please check this description, we reviewed it in the Wed meeting The Text to Image Gen AI benchmark adopts Stable Diffusion v1.5 for generating images from text prompts. It is a latent diffusion model. The benchmarked Stable Diffusion v1.5 refers to a specific configuration of the model architecture that uses a downsampling-factor 8 autoencoder with an 860M UNet,123M CLIP ViT-L/14 text encoder for the diffusion model, and VAE Decoder of 49.5M parameters. The model was trained on 595k steps at resolution of 512x512, which enables it to generate high quality images. We refer you to https://huggingface.co/benjamin-paine/stable-diffusion-v1-5 for more information. The benchmark runs 20 denoising steps for inference, and uses a precalculated time embedding of size 1x1280. Reference models can be found here https://github.com/mlcommons/mobile_open/releases |
be59d3c to
063c086
Compare
063c086 to
bc671d0
Compare
|



Uh oh!
There was an error while loading. Please reload this page.