How to get more variation in the null image

I've been generating images using this model, which is delightfully fast, but I've noticed that it produces images that are all alike. I tried generating the "null" image by doing:

```
H = perceptor.encode_text(toks.to(device)).float()
z = net(0 * H)
```

This resulted in:

![base image](https://user-images.githubusercontent.com/17042/188748164-81fcca02-5942-4121-b038-fab93db26512.png)

And indeed, everything I generated kind of matched that: you can see the fleshly protrusion on the left in "gold coin":

![gold-coin--0 0](https://user-images.githubusercontent.com/17042/188748035-72e4044f-5c38-4474-b9cd-7f4a753da811.png)

The object and matching mini-object in "tent":

![tent-0 5](https://user-images.githubusercontent.com/17042/188748114-da6f7f6f-3b83-472f-bf52-2ae6516f044e.png)

And it always seems to try to caption the image with nonsense lettering ("lion"):
 
![lion--0 0](https://user-images.githubusercontent.com/17042/188748066-d54ddef1-1f44-4de3-9e4b-3bdd2c4d9220.png)

So I'm wondering if there's a way to "prime" the model and suggest it use a different zero image for each run. Is there a variable I can set, or is this deeply ingrained in training data? 

Any advice would be appreciated, thank you!

(Apologies if this is the same as #8, but it sounded like #8 was solved by using priors which doesn't seem to help with this.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get more variation in the null image #27

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How to get more variation in the null image #27

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions