Enable Chroma finetune within 32gb RAM & 24gb VRAM #488

stefanasandei · 2025-10-29T13:17:20Z

Hello, I was trying to finetune Chroma1-HD on my setup, RTX 3090 and 32gb RAM. However, I was getting stuck during model loading, because the RAM was getting filled, resulting in my computer freezing every time.

In the ChromaModel class, the safetensors file is loaded first into the CPU, to be quantized. The Chroma checkpoint is too big to be loaded in full precision within 32gb RAM, so I changed the global dtype to be bfloat16 during the initial loading, and then the normal dtypes afterwards.

With this configuration, I can easily finetune Chroma with my setup.

I also fixed a bug related to FakeTextEncoder during loading (related issue: #405).

stefanasandei added 2 commits October 29, 2025 15:06

fix FakeTextEncoder bug when loading

12ba0dd

enable Chroma loading within 32gb RAM & 24gb VRAM

ef8e579

stefanasandei changed the title ~~Enable Chroma loading within 32gb RAM & 24gb VRAM~~ Enable Chroma finetune within 32gb RAM & 24gb VRAM Oct 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Enable Chroma finetune within 32gb RAM & 24gb VRAM #488

Enable Chroma finetune within 32gb RAM & 24gb VRAM #488

Uh oh!

stefanasandei commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Enable Chroma finetune within 32gb RAM & 24gb VRAM #488

Are you sure you want to change the base?

Enable Chroma finetune within 32gb RAM & 24gb VRAM #488

Uh oh!

Conversation

stefanasandei commented Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant