Skip to content

Commit 65db264

Browse files
authored
typos (#1491)
1 parent 8fcb00d commit 65db264

File tree

2 files changed

+5
-5
lines changed

2 files changed

+5
-5
lines changed

_blog.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2810,7 +2810,7 @@
28102810
- nlp
28112811

28122812
- local: wuertschen
2813-
title: "Introducing Würtschen: Fast Diffusion for Image Generation"
2813+
title: "Introducing Würstchen: Fast Diffusion for Image Generation"
28142814
author: dome272
28152815
thumbnail: /blog/assets/wuerstchen/thumbnail.jpg
28162816
date: September 13, 2023

wuertschen.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: "Introducing Würtschen: Fast Diffusion for Image Generation"
2+
title: "Introducing Würstchen: Fast Diffusion for Image Generation"
33
thumbnail: /blog/assets/wuerstchen/thumbnail.jpg
44
authors:
55
- user: dome272
@@ -11,14 +11,14 @@ authors:
1111
- user: pcuenq
1212
---
1313

14-
# Introducing Würtschen: Fast Diffusion for Image Generation
14+
# Introducing Würstchen: Fast Diffusion for Image Generation
1515

1616
<!-- {blog_metadata} -->
1717
<!-- {authors} -->
1818

19-
![Collage of images created with Würtschen](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/blog/wuertschen/collage_compressed.jpg)
19+
![Collage of images created with Würstchen](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/blog/wuertschen/collage_compressed.jpg)
2020

21-
## What is Würtschen?
21+
## What is Würstchen?
2222

2323
Würstchen is a diffusion model, whose text-conditional component works in a highly compressed latent space of images. Why is this important? Compressing data can reduce computational costs for both training and inference by orders of magnitude. Training on 1024×1024 images is way more expensive than training on 32×32. Usually, other works make use of a relatively small compression, in the range of 4x - 8x spatial compression. Würstchen takes this to an extreme. Through its novel design, it achieves a 42x spatial compression! This had never been seen before, because common methods fail to faithfully reconstruct detailed images after 16x spatial compression. Würstchen employs a two-stage compression, what we call Stage A and Stage B. Stage A is a VQGAN, and Stage B is a Diffusion Autoencoder (more details can be found in the  **[paper](https://arxiv.org/abs/2306.00637)**). Together Stage A and B are called the *Decoder*, because they decode the compressed images back into pixel space. A third model, Stage C, is learned in that highly compressed latent space. This training requires fractions of the compute used for current top-performing models, while also allowing cheaper and faster inference. We refer to Stage C as the *Prior*.
2424

0 commit comments

Comments
 (0)