VAE Trainer #286

tkgix · 2023-03-11T14:46:29Z

tkgix
Mar 11, 2023

お疲れ様です.

VAE Trainerについて提案します。

ご存知のように、VAEは最終的な詳細ピクセルの表現を決定する要素です。

VAE Trainはすでにかなり前にstable diffusion 1.5開発時にコードが公開されています。
https://github.com/CompVis/latent-diffusion

個人的に改造して使っていますが、設置過程が難しくて他人にはおすすめできませんね。

このrepoにVAE Trainerを追加して公開すれば、様々な良いVAEが生まれると期待しています。

VAE訓練課程はlatentに変換して復元することを繰り返す単純な過程なので、適切なサイズの画像だけを入れて回すのでOKです。
私の経験ではFine-tuningまたはLORAトレーニングに使用したデータセットをそのまま使用して、そのモデル専用のVAEを作ることでもかなりの効果があります。モデルが表現できない非常に狭い領域の表現を手伝ってくれます。

または、finetuningの際にVAEも一緒にトレーニングする方法も考えられます。

考慮してみてください。
いつも応援しています。それでは。

P.S: 別の話ですが、metadataファイルを制作する過程なしにFine-tuningができればと思います。多分このrepoの一番大きな短所ではないかと思います(笑)

idlebg · 2023-03-30T12:49:44Z

idlebg
Mar 30, 2023

I am already doing this with the public code.
But +1 if we can have this here too 🤟 🥃

1 reply

arcanite24 Nov 24, 2023

Hey @idlebg, are you fine tuning the VAE with the Latent Diffusion code base?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

VAE Trainer #286

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

VAE Trainer #286

Uh oh!

Uh oh!

tkgix Mar 11, 2023

Replies: 1 comment · 1 reply

Uh oh!

idlebg Mar 30, 2023

Uh oh!

arcanite24 Nov 24, 2023

tkgix
Mar 11, 2023

Replies: 1 comment 1 reply

idlebg
Mar 30, 2023